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NUCLEIC ACIDS AND PROTEINS OF A RAT GANGLIOSIDE GM,-SPECIFIC 
al-2FUCOS YLTRANSFERASE AND USES THEREOF 



This invention was made with government support under Research Grant 
5 CA70740 from the National Cancer Institute. The government has certain rights in the 
invention. 

1. FIELD OF THE INVENTION 

The present invention relates to a rat ganglioside GM,-specific 

10 a 1 -* 2fucosy ltransferase. The invention provides novel nucleotide sequences of a rat 

al^2mcosy ltransferase specific for a carbohydrate moiety found in ganglioside GM„ more 
particularly, specific for a terminal galactose (3 1 -» 3AT-acetylgalactosamine 
(Galpl-»3GalNAc) saccharide, amino acid sequences of its encoded protein (including 

^ peptide or polypeptide), and derivatives and analogs thereof. Merely for the ease of 
description, the enzyme is herein referred to as "GM,-specific" or "ganglioside GM,- 
specific". The invention also relates to fragments (and derivatives and analogs thereof) 
which comprise a domain of rat ganglioside GM,-specific a l-2fucosy ltransferase with 

20 catalytic activity. Methods of production of rat ganglioside GM r specific 

a 1 ->2fucosy ltransferase and derivatives and analogs thereof {e.g. by recombinant means) 
are provided. In addition, the invention relates to methods of inhibiting the function of rat 
ganglioside GM,-specific al-2fucosyltransferase {e.g. by means of antisense RNA). The 

25 invention further relates to use of rat ganglioside GM r specific al-2fucosy ltransferase in 
the preparative production of fucosyl-GM!. Applications of fucosyl-GM,, for example as an 
immunotherapeutic for cancer, are disclosed. 

2. BACKGROUND OF THE INVENTION 

30 Citation of a reference herein shall not be construed as an admission that 

such reference is prior art to the present invention. 

2.1. FUCOSYLTRANSFERASES 

Fucosyltransferases are enzymes that catalyze the addition of a fucose 
35 residue to a terminal galactose acceptor of saccharide precursors. Fucosyltransferase 
activity is involved in the production of oligosaccharides, glycolipids or glycoproteins. 
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There are four known classes of fucosyltransferases, namely those that catalyze the addition 
of fucose in al -2, al - 3, al -4 and al - 6 linkages. 

Fucosyltransferases are best known for their roles in the synthesis of the 
oligosaccharide moieties that comprise blood group antigenic determinants. For example, 
the fucosyltransferase encoded by the H gene catalyzes the transfer of fucose in an al -2 
linkage to the terminal galactose of Gal(pl-4)GlcNAc((31-3)Gal-R to produce 'H substance' 
on the surface of erythrocytes. Further addition of N-acetylgalactosamine or galactose leads 
to the formation of the type A or type B blood group substances respectively. An analogous 

j Q enzyme encoded by the Se locus catalyzes the formation of 'H substance' in epithelial 
tissues for secretion rather than presentation at the cell surface (Rosen et al., 1989, 
Dictionary of Immunology , Stockton Press, New York, pp. 1-3). 

Previous experiments with H35 hepatoma cell extracts demonstrated that 
transfer of fucose to neolacto-series acceptors occurred at a rate only 2% of that found for 

15 GM, (Holmes, E.H., et al, 1983, J. Biol. Chem, 258:3706-3713). This substrate specificity 
is more restricted compared to other cloned al ^fucosyltransferases but is most closely 
related to secretor-type enzymes (Larsen, R.D., et al., 1990, Proc. Natl. Acad. Sci. USA 
87:6674-6678; Kelly, R.J., et al., 1995, J. Biol. Chem. 270:4640-4649; Hitoshi, S., et al., 
1995, J. Biol. Chem. 270:8844-8850; Hitoshi, S., et al., 1996, J. Biol. Chem. 271:16975- 

20 16981). 



2.2. STRUCTURE OF al —2FUCOSYLTRANSFERASES 

To date, a number of genes encoding H-type and Se-type 
al ^fucosyltransferases have been cloned from several species of organisms. Three human 
al ^fucosyltransferases (Larsen et al., 1990, Biochemistry 87:6674-6678; Koda et al., 
1997, Eur. J. Biochem. 246:750-755; Kelly et al., 1995, J. Biol. Chem. 270:4640-4649), 
three rabbit al-*2fucosyltransferases (known as RFT-I, RFT-II and RFT-III) (Hitoshi et al., 
1995, J. Biol. Chem. 270:8844-8850; Hitoshi et al., 1996, J. Biol. Chem. 271:16975-19681), 
and two mouse al-*2fucosyltransferases (Tsuji, 1996, GenBank accession no. Y09882; Lin 
et al., 1998, GenBank accession no. AF064792) have been described. Piau et al. (1994, Eur. 
J. Biochem. 300:623-626) disclose fragments, designated FTA and FTB, of two rat 
al ^fucosyltransferases isolated from rat PROb colon adenocarcinoma cells. Piau et al. 
showed that antisense expression of the FTA or FTB nucleic acid fragments inhibited the 
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endogenous cd^2fiicosyltransferase activity of PROb cells with respect to the synthetic 
fucose acceptor phenyl P-D-galactopyranoside; however the FTB fragment was not shown 
to be sufficient for al- > 2fucosyltransferase catalytic activity, nor was the substrate 

5 specificity of the PROb a 1 - 2fucosyltransferase activity determined. 

H-type al-2fucosyltransferases are membrane localized whereas Se-type 
al-2fucosyltransferases are localized to the Golgi apparatus. Amino acid sequence 
alignment of membrane bound H-type al-*2fucosyltransferases reveals that, like other 

10 glycosyltransferases, there exists a homologous domain structure comprising a short 

intracellular N-terminal domain, a transmembrane domain, an extracellular stem region not 
required for enzymatic activity, and finally, the catalytic domain at the C-terminus. 
Generally, there is little sequence homology outside the catalytic domain. 

1 5 2.3. GANGLIOSIDE GM, AND ITS FUCOSYLATED DERIVATIVE 
FUCOSYL-GM , 

Gangliosides are cell surface constituents comprising glycosphingolipids 
(produced by the linking of ceramides to oligosaccharides) with sialic acid residues. 
Depending on the number of sialic acid residues they possess, gangliosides are known as 

20 mono-, di-, tri- or polysialogangliosides. GM, stands for ganglioside mono(sialic acid)_, . 

Fucosyl-GM,, detected by monoclonal antibodies, is found largely in the 
nervous system, and in particular on a subpopulation of neurons in the dorsal root ganglia 
and dorsal horn of the spinal cord, as well as on surrounding satellite cells surrounding the 
fucosyl-GM, positive neurons (Kusunoki et al., 1989, Brain Res. 494:391-395; Kusonoki et 

25 al., 1992,Neurosci. Res. 15: 74-80). 

Gangliosides have long been implicated in diseased states. They are often 
prominent cell surface constituents of transformed cells (see Section 2.5, infra) and 
alterations in their metabolism give rise to diseases of the nervous system. For example, 
several fatal hereditary diseases are caused by lysosomal storage of gangliosides wherein the 

30 absence or deficiency of lysosomal enzymes results in the deleterious accumulation of 
gangliosides. The most well known of these diseases is the neurodegenerative Tay-Sachs 
disease, which is characterized by the accumulation of ganglioside GM 2 . Accumulation of 
GM, results in GM, Gangliosidosis. 

35 2.4. REGULATION OF FUCOSYLTRANSFERASE EXPRESSION 
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'H substance', the fucosylated precursor of blood group determinants, is 
strictly regulated temporally and spatially during vertebrate development (Fenderson et al., 

1986, Dev. Biol. 114:12-21). 

Dramatic changes in the expression of cell surface glycolipids are found with 

5 oncogenesis (Hakomori, 1 989, Adv. Cancer Res. 52:257-33 1 ; Alhadeff, 1989, CRC Crit. 
Rev. Oncol./Hematol. 9:37-107). These changes frequently are oncofetal in nature in that a 
particular carbohydrate structure may be expressed during normal fetal development, 
disappear in adult tissues, and reappear in association with oncogenesis giving rise to a 
premalignant or malignant marker. One such example is expression of the ganglio-B 

1 0 determinant (II 3 NeuAcIV 3 ccGalIV 2 FucGg 4 ) during early stages of chemical carcinogenesis 
in rat liver with iV-2-acetylaminofluorene (AAF) (Holmes and Hakomori, 1982, J. Biol. 
Chem. 257:7698-7703; Scribner et al., 1983, Environ. Health Perspect. 49:81-89). 
Expression of this determinant has been shown to be a property of liver parenchymal cells 
resulting from a carcinogenic stimulus but not hepatotoxicity (Holmes, 1990, 

1 5 Carcinogenesis 1 1 : 89-94). This determinant has also been shown to be developmentally 
regulated in rat stomach (Bonhours et al., 1987, J. Biol. Chem. 258:3706-3713). Expression 
of this antigen is due to the activation of an a 1 -2fucosyltransferase which is normally 
unexpressed in adult rat liver parenchymal cells. This enzyme efficiently transfers fucose 
onto the terminal galactose residue of a GM, precursor, producing fucosyl-GMj 

20 (IV 3 NeuAcIV 2 FucGgOse 4 Cer). Fucosyl-GM, is a substrate for a constituitively expressed 
al -3galactosyltransferase forming the blood group B determinant on a ganglioside core 
chain (Holmes and Hakomori, 1983, J. Biol. Chem. 258:3706-3713; Holmes and Hakomori, 

1987, J. Biochem. 258:3706-3713). This al^3galactosyltransferase behaves as a blood 
25 group B transferase in that it efficiently catalyzes transfer of galactose in al ^3-linkage to 

terminal galactose residues of al—2fucosylated neolacto- and ganglio-series acceptors 
(Holmes and Hakomori, 1983, J. Biol. Chem. 258:3706-3713). 

High al - > 2fucosyltransferase expression is observed in rat hepatoma H35 

30 cells (Holmes and Hakomori, 1983, J. Biol. Chem. 258:3706-3713; Holmes and Hakomori, 
1987, J. Biochem. 258:3706-3713). The enzyme from H35 cells has specificity for a 
ganglio-series core chain. These cells accumulate large amounts of fucosyl-GM[ (Baumann, 
H., et al., 1979, Cancer Res. 39:2637-2643). Enzymological studies indicated this enzyme 
was inhibited by a wide variety of detergents, an unusual property for a membrane bound 

35 glycosyltransferase (Holmes, E.H., et al, 1983, J. Biol. Chem, 258:3706-3713). This 
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property may reflect a role for membrane phospholipids in maintaining the enzyme in an 
active conformation (Holmes and Hakomori, 1987, J. Biochem. 101:1095-1105). Later 
studies demonstrated that active enzyme could be solubilized from H35 cell membranes by 
0.4% CHAPSO which bound to the affinity resin GDP-hexanolamine-Sepharose (Holmes, 
5 E.H.,etaL, 1987, J. Biochem. 101:1095-1105). 

Further, the observation about the production by transformed cells of high 
levels of fucosyl-GM, as a result of al-2fucosyltransferase activity, is not restricted to rat 
hepatoma cells. For example, in humans, fucosyl-GM, is associated with small cell lung 
carcinoma (Fredman et al., 1986, Biochim. Biophys. Acta 875:316-323; Nilsson et al., 1984, 
10 Glycoconjugate J. 1:43-49). 

Generally, enzymatic oligosaccharide synthesis (including synthesis of 
glycolipids, glycoproteins, etc.) has been limited by the difficulty of isolation and 
enrichment of glycosyltransferases from natural sources. Thus, there is a need for methods 
to produce easily isolatable quantities of glycosyltranferases with high enzymatic activity. 
1 5 Such glycosyltransferases, produced, e.g. in vitro, would be useful reagents in 

compensating for the lack of natural resources. In particular, there is a need for methods to 
produce easily isolatable GM r specific al^2mcosy transferase. The ability to synthesize 
fucosyl-GM, in vitro is of particularly high value, as the ganglioside is important for the 
20 development of the mammalian nervous system. GM r specific al ^2fucosy transferase can 
be used to catalyze the addition of fucose residues to terminal Gaipi-3GalNAc saccharide 
chains of glycoproteins, glycolipids, glycolipoproteins and oligosaccharides, producing 
saccharide compositions that are useful nutritional additives or bases therefor. Further, 
fucosyl-GM, is envisaged to be an important tool in cancer therapy and cancer diagnostics. 
25 Until the cloning and characterization of the nucleic acid and amino acid sequences of the 
catalytic domain and the full length al^2fucosyltransferase of the present invention, no 
al-*2fucosyltransferases with GM, specificity had been identified. 

30 3. SUMMARY OF THE INVENTION 

The present invention provides a rat ganglioside GM,-specific 
a 1^2mcosy transferase. As indicated above, the novel nucleic acids of the invention 
encode an a l-2fucosy transferase enzyme specific for a terminal Galpl->3GalNAc 
35 saccharide found naturally in ganglioside GM,. According to the present invention, the 
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novel nucleic acids encode an al-2fucosyltransferase enzyme specific for the terminal 
Gaip l-3GalNAc moiety which can be a part of a glycoprotein, a glycolipid, a 
glycolipoprotein or free oligosaccharide or polysaccharide molecule. Merely for ease of 
description, and not limitation, the enzyme is referred to herein as "GM, -specific" or 
5 "ganglioside GM,-specific". More particularly, the invention encompasses nucleotide 

sequences of a rat ganglioside GM,-specific al^2fucosyltransferase, amino acid sequences 
of its encoded protein (including peptide or polypeptide), and derivatives and analogs 
thereof. The invention further encompasses fragments (and derivatives and thereof) which 
1 o comprise a domain of rat ganglioside GM,-specific a 1 - 2fucosyltransferase with catalytic 
activity. Methods of production of rat ganglioside GM, -specific al-2fucosyltransferase 
(e.g. by recombinant means), and derivatives and thereof, are provided. Methods of _ 
inhibiting the function of ganglioside GM,-specific al-2fucosyltransferase (e.g. by means 
j 5 of antisense RNA) are provided. The invention further encompasses methods for the use of 
rat ganglioside GM r specific al-2fucosyltransferase in the production of glycoproteins, 
glycolipids, glycolipoproteins and free oligo- or polysaccharides. Examples of uses of these 
products, such as uses as nutritional additives, are provided. The methods are particularly 
useful as they can be used in preparative biosynthesis of these saccharide-containing 
20 compositions, and are adaptable to such synthesis in large or commercial scale production. 
Of particular importance is the synthesis of fucosyl-GM„ which is useful as an 
immunotherapeutic against cancer and neurological disease. 

This invention provides an isolated or purified protein comprising an amino 
acid sequence as depicted in Figure 5 (SEQ ID NO:8). The invention further provides an 
25 isolated or purified protein comprising amino acids 28-380 of SEQ ID NO:8 as depicted in 
Figure 3A (SEQ ID NO: 10). 

This invention provides an isolated or purified protein consisting of an 
amino acid sequence as depicted in Figure 5 (SEQ ID NO:8). 

The invention further provides an isolated or purified protein consisting of 
30 amino acids sequence numbers 28-380 of SEQ ID NO:8 as depicted in Figure 3 A (SEQ ID 
NO: 10). 

This invention provides an isolated or purified protein, the amino acid 
sequence of which consists of a catalytic domain defined by amino acid numbers 1-353 as 
depicted in Figure 3 A (SEQ ID NO: 10) or amino acid numbers 28-380 as depicted in 
35 Figure 5 (SEQ ID NO:8). 
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This invention provides an isolated or purified protein, the amino acid 
sequence of which consists of amino acid numbers 1-380 as depicted in Figure 5 (SEQ ID 
NO:8) covalently linked to at least a portion of a second protein, which second protein is not 
said protein defined by the amino acid sequences as depicted in Figure 5 (SEQ ID NO:8). In 
5 another embodiment, the protein is fused by a covalent bond to at least a portion of a second 
protein, wherein said portion is the IgG binding domain of protein A. 

This invention provides an isolated or purified protein, the amino acid 
sequence of which consists of amino acids numbers 28-380 as depicted in Figure 5 (SEQ ID 
NO:8) or amino acids numbers 1-353 as depicted in Figure 3A (SEQ ID NO: 10) covalently 
1 0 linked to at least a portion of a second protein, which second protein is not said protein 
defined by the amino acid sequences as depicted in Figure 5(SEQ ID NO:8). In another 
embodiment, the protein is fused by a covalent bond to at least a portion of a second protein, 
wherein said portion is the IgG binding domain of protein A. 

This invention provides an isolated nucleic acid comprising a nucleotide 
1 5 sequence as depicted in Figure 5 (SEQ ID NO:7). 

This invention provides an isolated nucleic acid comprising a nucleotide 
sequence encoding an amino acid sequence as depicted in Figure 5 (SEQ ID NO:8). 

This invention provides an isolated nucleic acid comprising a nucleotide 
sequence as depicted in Figure 3 A (SEQ ID NO:9). 
20 This invention provides an isolated nucleic acid comprising a nucleotide 

sequence encoding an amino acid sequence as depicted in Figure 3 A (SEQ ID NO: 10). 

This invention provides an isolated RNA molecule comprising a nucleotide 
sequence as depicted in Figure 5 (SEQ ID NO:7), wherein the base U(uracil) is substituted 
for the base T (thymine) of said sequence. 
25 This invention provides an isolated RNA molecule comprising a nucleotide 

sequence encoding an amino acid sequence as depicted in Figure 5 (SEQ ID NO:8). 

This invention provides an isolated RNA molecule comprising a nucleotide 
sequence as depicted in Figure 3 A (SEQ ID NO:9), wherein the base U(uracil) is substituted 
for the base T (thymine) of said sequence. 
30 This invention provides an isolated RNA molecule comprising a nucleotide 

sequence encoding an amino acid sequence as depicted in Figure 3 A (SEQ ID NO: 10). 

This invention provides an isolated nucleic acid comprising a nucleotide 
sequence that is the reverse complement of a nucleotide sequence encoding an amino acid 
sequence as depicted in Figure 5 (SEQ ID NO: 8). 

35 
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This invention provides an isolated nucleic acid comprising a nucleotide 
sequence that is the reverse complement of a nucleotide sequence encoding an amino acid 
sequence as depicted in Figure 3A (SEQ ID NO: 10). 

This invention provides a vector comprising (a) a nucleotide sequence as 
5 depicted in Figure 5 (SEQ ID NO:7)and (b) an origin of replication. In one embodiment, 
the nucleotide sequence is operably linked to a heterologous promoter. 

This invention provides a vector comprising (a) a nucleotide sequence as 
depicted in Figure 3 A (SEQ ID NO:9)and (b) an origin of replication. In one embodiment, 
the nucleotide sequence is operably linked to a heterologous promoter. 
IQ This invention provides a vector comprising (a) a nucleotide sequence that is 

the reverse complement to all or a fragment of the nucleotide sequence as depicted in Figure 
5 (SEQ ID NO:7) and (b) an origin of replication. In one embodiment, the nucleotide 
sequence is operably linked to a heterologous promoter. 

This invention provides a vector comprising (a) a nucleotide sequence that is 
1 5 the reverse complement to all or a fragment of the nucleotide sequence as depicted in Figure 
3A (SEQ ID NO:9) and (b) an origin of replication. In one embodiment, the nucleotide 
sequence is operably linked to a heterologous promoter. 

The invention provides a vector comprising (a) a nucleotide sequence 
encoding an amino acid sequence as depicted in Figure 5 (SEQ ID NO:8) and (b) an origin 
20 of replication. 

The invention provides a vector comprising (a) a nucleotide sequence 
encoding an amino acid sequence as depicted in Figure 3A (SEQ ID NO: 10) and (b) an 
origin of replication. 

The invention provides a recombinant cell containing a recombinant nucleic 
25 acid vector comprising a nucleotide sequence as depicted in Figure 5 (SEQ ID NO:7). In 
one embodiment, the recombinant cell is a eukaryotic cell and preferably a mammalian cell. 

The invention provides a recombinant cell containing a recombinant nucleic 
acid vector comprising a nucleotide sequence as depicted in Figure 3A (SEQ ID NO:9). In 
one embodiment, the recombinant cell is a prokaryotic cell and preferably a bacterial cell. 
3 0 This invention provides a method of producing a rat a 1 - 2f\icosy transferase 

protein comprising: (a) culturing a recombinant cell containing a vector comprising a 
recombinant nucleotide sequence as depicted in Figure 5 (SEQ ID NO:7), such that the 
cd -2fucosy transferase protein, encoded by SEQ ID NO:7, is expressed by the cell; and (b) 
recovering the expressed protein or a cellular fraction containing said protein. In one 
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embodiment, the invention provides the purified protein produced by the method. In 
another embodiment, the invention provides a cellular fraction with said protein activity. 

This invention provides a method of producing a rat a l-2fucosy transferase 
protein comprising: (a) culturing a recombinant cell containing a vector comprising a 

5 recombinant nucleotide sequence as depicted in Figure 3A (SEQ ID NO:9), such that the 
a l-2fucosy transferase protein, encoded by SEQ ID NO:9, is expressed by the cell; and (b) 
recovering the expressed protein or a cellular fraction containing said protein. In one 
embodiment, the invention provides the purified protein produced by the method. In 

1 o another embodiment, the invention provides a cellular fraction with a 1 - 2fucosyltransferase 
protein activity. 

This invention provides a method of producing a rat al-2fucosyltransferase 
protein comprising: (a) culturing a recombinant cell containing a vector comprising a 
recombinant nucleotide sequence encoding a protein sequence as depicted in Figure 5 (SEQ 
ID NO:8), such that the al ^2fucosy transferase protein, encoded by SEQ ID NO:7, is 
expressed by the cell; and (b) recovering the expressed protein or a cellular fraction 
containing said protein. In one embodiment, the invention provides the purified protein 
produced by the method. In another embodiment, the invention provides a cellular fraction 

20 with a 1 — 2fucosyl transferase protein activity . 

This invention provides a method of producing a rat al -'2fucosyltransferase 
protein comprising: (a) culturing a recombinant cell containing a vector comprising a 
recombinant nucleotide sequence encoding a protein sequence as depicted in Figure 3 A 

25 (SEQ ID NO: 10), such that the al -2fucosy transferase protein, encoded by SEQ ID NO:9, 
is expressed by the cell; and (b) recovering the expressed protein or a cellular fraction 
containing said protein. In one embodiment, the invention provides the purified protein 
produced by the method. In another embodiment, the invention provides a cellular fraction 
with a l—2fucosy transferase protein activity. 

3 0 This invention provides a method for detecting the onset of liver cancer 

comprising the detection of the expression of a nucleotide sequence as depicted in Figure 5 
(SEQ ID NO:9) or a fragment or complement thereof. 

This invention provides a method to suppress or inhibit from a cell the 
function of the protein of the invention, which method comprises contacting said cell with a 

35 nucleic acid comprising a nucleotide sequence that is the reverse complement of a 
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nucleotide sequence as depicted in Figure 5 (SEQ ID NO:7) or a fragment thereof, or as 
depicted in Figure 3A (SEQ ID NO:9) or a fragment thereof, and wherein when said nucleic 
acid is RNA, the base T (thymine) in SEQ ID NO:7 and SEQ ID NO:9 is substituted by the 
base U (uracil). In one embodiment, said nucleic is contained within an adenoviral or 
5 retroviral vector. In another embodiment, the cell is a human small cell lung carcinoma cell. 
The invention provides methods for the preparative synthesis of 
compositions comprising Fucal-2Galpl-3GalNAc, said methods comprising contacting 
isolated or purified rat cd-2fucosyltransferase or a cellular fraction containing 
10 a 1 — 2fucosy Itransferase with GDP-fucose and a molecule having a terminal 

Gaipi-3GalNAc moiety. The molecule having a terminal Galpl-3GalNAc moiety can be 
a glycolipid, a glycoprotein, a glycolipoprotein or a free saccharide. 

Thus, the invention provides methods for the preparative synthesis of 
glycolipids, glycoproteins, glycolipoproteins or free oligosaccharides comprising 
1 5 Fuccel -2Galp 1 - 3GalNAc. In one embodiment, the fucosyl-glycolipid, -glycoprotein, - 
glycolipoprotein or -free oligosaccharide or -polysaccharide produced by the method of the 
invention is used as an additive to a nutritional formula. 

In a particular embodiment, the invention provides a method for the 
preparative synthesis of fucosyl-GM, comprising contacting isolated or purified rat 

20 

a 1 - 2fucosyltransferase or a cellular fraction containing a 1 - 2fucosyltransferase with GDP- 
fucose and the ganglioside GM, and recovering fucosyl-GM,. 

The invention provides methods for the use of fucosyl-GM! in 
immunotherapy for human disease comprising administering said compound to a human 
25 patient with a disease. In one embodiment, the disease is cancer or neurological disease. In 
a specific preferred embodiment, said patient has small cell lung carcinoma. 

3.1. ABBREVIATIONS 

As used herein, the following abbreviations shall have the meanings 

30 indicated. 

AAF: AT-2-acetylaminofluorine 
cd - 2FucT : a 1 - 2fucosyltransferase 
cDNA: complementary DNA 
3 5 FucT, fucosy transferase 
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fucosyl-GM, :II 3 NeuAcIV 3 FucGg4,Fuca 1 - 2Gaip 1 - 3 
GalNAcP 1 -4[NeuAca2- 3]Galp 1 -4GlcP 1 - 1 Cer 
ganglio-B: II 3 NeuAcIV 3 aGalIV 2 FucGg 4 , Galal-3[Fucal-2] 
5 Gaip 1 - 3GalNAcP 1 -4[NeuAca2-3]Gaip 1 -4GlcP 1 - 1 Cer 

GM,:II 3 NeuAcGg 4 ,Gaip 1 -3GalNAcP 1 - 4[NeuAca2-3]Galpl - 4 
Glcpl-lCer 

nLc 4 : lacto«eotetraosylceramide or 
10 Gaip 1 -4GlcNAcP 1 -3Gaipi -4GlcP 1 - 1 Cer 

PCR: polymerase chain reaction 

RT-PCR: reverse transcription - polymerase chain reaction 

4. BRIEF DESCRIPTION OF THE FIGURES 

Figure 1 . Portions of aligned nucleotide sequences of human (SEQ ID 
NO.'s: 12-20) and rabbit (SEQ ID NO.'s:21-29) al~»2FucT nucleic acids. The regions 
corresponding to forward and reverse primers used in the Example described infra in 
Section 6 are indicated except for Primer III (SEQ ID NO:3) which corresponds to the most 
20 3' end of the open reading frame. 

Figure 2. RT-PCR analysis of rat hepatoma H35 cell total RNA. Lane 1, RT- 
PCR product generated using primers I (SEQ ID NO: 1) and II (SEQ ID NO:2); lane 2, RT- 
PCR product generated using primers I (SEQ ID NO:l) and III (SEQ ID NO:3); lane 3, RT- 
25 PCR product generated using primers V (SEQ ID NO:5) and III (SEQ ID NO:3). Seven ul 
of each PCR mix was electrophoresed in a 0.8% agarose gel in IX TBE buffer. The gel was 
stained with ethidium bromide. Size standards of 1.0, 0.75, and 0.5 kb are indicated. 

Figure 3(A-B). Nucleotide (SEQ ID NO:9) and deduced amino acid sequence (SEQ 
30 id NO: 1 0) of the catalytic domain of rat hepatoma H35 cell cd -2FucT. Figure 3 A. 

Nucleotide and deduced amino acid sequence of the 1068-bp rat hepatoma H35 cell a 1-2 

FucT RT-PCR product generated with primers V (SEQ ID NO:5) and III (SEQ ID NO:3). 

The sequence extends from the second C residue following the EcoEI site in primer V 
35 through the end of primer III (SEQ ID NO:3). This nucleotide sequence has been deposited 

in GenBank with the Accession No. AF042743. The sequence is translated in reading frame 



- 11 - 



NY2- 950266. 1 



1. Potential N-linked glycosylation sites are shaded. The region which overlaps rat FTB is 
indicated by a solid line over the sequence. The amino acid differing between the H35 cell 
sequence and that predicted by the rat FTB sequence is underlined. The stop codon is 
indicated in bold lettering. Figure 3B. Comparison of amino acid sequence homology 
5 between the catalytic domain of rat hepatoma H3 5 cell a 1 - 2FucT and human Sec2 (SEQ 
ID NO:ll). 

Figure 4. TLC analysis of reaction products from transfer of [ 14 C]fucose to 
GM, and nLc 4 catalyzed by the pPROTA-expressed catalytic domain of rat hepatoma H35 

10 cell cd -2FucT. Lanes 1 and 3 show results from pPROTA expressed H35 cell al-2FucT 
in the forward orientation. Lanes 2 and 4 show results from pPROTA expressed H35 cell 
oel-2FucT in the reverse orientation. Lanes 1 and 2, transfer to GM,; lanes 3 and 4, 
transfer to nLc 4 . The arrow indicates the TLC mobility of standard fucosyl-GM,. The 

1 5 solvent system was composed of CHCl 3 :CH 3 OH:H 2 0 (60:40:9), containing 0.02% 
CaCl 2 2H 2 0. See, infra, Section 6 for details. 

Figure 5. Nucleotide (SEQ ID NO:7) and deduced amino acid sequence 
(SEQ ID NO:8) of the 1 140 bp rat hepatoma H35 cell cd-2FucT RT-PCR product 

20 generated with primers VI (SEQ ID NO:6) and III (SEQ ID NO:3). The entire coding 
region of 380 amino acids through the stop codon is represented. Potential N-linked 
glycosylation sites are highlighted. The region which was found to overlap rat FTB is 
indicated by a solid line over the sequence. The amino acid differing between the H35 
sequence and that predicted by the rat FTB is underlined. The intra-cellular/transmembrane 

25 domain comprised of 81 nucleotides (27 amino acids), is shown in larger italic font. 

Figure 6. TLC analysis of reaction products from transfer of [ 14 C] to GM, 
catalyzed by expressed recombinant full length rat hepatoma ccl-2FucT. Lane A: transfer 
to GM, in absence of detergent or phospholipid; Lane B: transfer to GM, in the presence of 
30 phosphatidylglycerol (PPG), Lane C: transfer to GM, in the presence of PPG and G3634A 
detergent, and Lane D: transfer to GM, in the presence of CHAPSO detergent. The 
reactions were conducted for two hours at 37°C. GM, standard is indicated. The solvent 
system was composed of CHC1 3 :CH 3 0H:H 2 0 (60:40:9), containing 0.02% CaCl 2 2H 2 0. 
See, infra, Section 7 for details. 
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Figure 7, PCR products generated using primers I (SEQ ID NOT) and II 
(SEQ ID NO:2) in RT-PCR analysis total RNA from rat hepatoma H35 cells and from 
normal rat liver tissue. RT-PCR analysis was performed on Lane 1: Total RNA from rat 
hepatoma H35 cells, Lane 2: Total RNA from normal rat liver tissue, and Lane 3: Total 
5 RNA from AAF-fed rat liver tissue. The arrow on right indicates location of 0.6-kb PCR 
product. Size markers (in kb) are indicated on left. Five ul of each PCR mix was 
electrophoresed in a 0.8% agarose gel in lx TBE buffer. The gel was stained with ethidium 
bromide. See, infra, Section 8 for details. 

1 o Figure 8(A-B). Figure 8 A. TLC Analysis of reaction products from transfer of [ ,4 C] 

to GM, catalyzed by full length expressed recombinant cd-2FucT from COS-7 cells 
transfected with FL-RFT-pcDNA3 in the presence of increasing concentrations of antisense 
FL-RFT(-)-pcDNA3. All reactions were carried out in the presence of CHAPSO detergent. 
Equimolar ratios of total DNA were maintained in each transfection by including Varying 

15 concentrations of pcDNA3 plasmid (vector minus insert). All lanes except Lane II were 
transfected with 1 ug of FL-RFT-pcDN A3 . Total FL-RFT(-)-pcDNA3 transfected was as 
follows: Lane I - 0 ug, Lane II - 1 .0 ug, Lane III - 1 .0 \ig, Lane IV - 2.0 ug, Lane V - 3.0 ug 
, and Lane VI - 5.0 ug. The solvent system was composed of CHC1 3 :CH 3 0H:H 2 0 (60:40:9), 
containing 0.02% CaCl 2 2H 2 0. The GM, standard was visualized by spraying in 0.5% 

20 orcinol in 2 N sulfuric acid; Figure 8B. Percentage reduction of initial a l~2FucT activity 
by increasing doses of FL-RFT(-)-pcDNA3. The major reaction product in each lane 
(indicated by arrow) (see Figure 8A) was scraped off the plate and counted in a scintillation 
counter. Cpm minus background counts of 1 17 (Lane II) and percentage reduction of initial 
cd-2FucT activity by increasing doses of FL-RFT(-)-pcDNA3 are shown. See, infra, 

25 Section 9 for details. 

Figure 9. Preparative in vitro biosynthesis of fucosyl-GM, utilizing 
recombinant rat cd-2fucosyltransferase. The results demonstrate the appearance of 
increasing amounts of a slower migrating band corresponding to fucosyl-GM, from transfer 
30 of fucose in the al-2-linkage to the added GM, acceptor with time. The enzyme is very 
active, yielding almost complete conversion to fucosyl-GM, after 24 to 48 hours. See, infra, 
Section 10 for details. 

5. DETAILED DESCRIPTION OF THE INVENTION 

35 
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As described herein, the inventors have discovered and characterized a new 
ganglioside GM, -specific al-*2fucosyltransferase gene, representing the first instance in 
which a nucleotide sequence encoding a fucosyltransferase with GM, -specificity has been 
identified. The novel nucleotide sequence and novel encoded protein constitute very useful 

5 tools for the preparative synthesis of fucosyl-containing glycolipids, glycoproteins, 
glycolipoproteins and oligosaccharides. In a particular embodiment, the nucleotide 
sequences and encoded proteins are useful for the preparative synthesis of fucosyl-GM,. 

The present invention thus encompasses proteins encoded by and nucleotide 
sequences of a rat, GM,-specific al-*2fucosyltransferase gene. The invention further 
encompasses derivatives and analogs of such al-*2fucosyltransferase protein. Nucleic 
acids encoding such derivatives or analogs are also within the scope of the invention. 
Production of the foregoing proteins, e.g., by recombinant methods, is provided. 

The invention also encompasses al-2fucosyltransferase protein derivatives 

1 5 and analogs which are functionally active, i.e. , which are capable of displaying catalytic 
activity associated with a full-length GM,-specific al^2fucosyltransferase protein. 
Catalytic activity is defined as the ability to mediate the synthesis of fucosyl-GM, from 
starting materials consisting of the ganglioside GM, and the sugar nucleotide donor GDP- 

20 fucose. 

For clarity of disclosure, and not by way of limitation, the detailed 
description of the invention is divided into the following subsections which describe or 
illustrate certain features, embodiments or applications of the invention. 



5.1. ISOLATION OF RAT a 1 - 2FUCOS YLTRANSFERASE NUCLEIC 

ACIDS 

The invention relates to the nucleotide sequences of a rat GM,-specific 
al-2fucosyltransferase (hereinafter al-2FucT). The invention provides isolated or 
purified nucleic acids comprising an al-2FucT encoding sequence; in another embodiment, 
the nucleic acids comprise the 1069 nucleotide catalytic region of an al-2FucT sequence. 
Nucleic acids can be single or double stranded. The invention also relates to nucleic acids 
hybridizable to or complementary to the foregoing sequences or their reverse complements. 
In specific aspects, nucleic acids are provided which comprise a sequence complementary tc 
at least the 1069 nucleotide catalytic of an al-2FucT gene domain, or the entire coding 
region. 
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5.1.1. HYBRIDIZATION CONDITIONS 

In a specific embodiment, a nucleic acid which is hybridizable to an 
al -2FucT nucleic acid (e.g., having a sequence as set forth in SEQ ID NO:7, or to its 
reverse complement, or to a nucleic acid encoding an al-2FucT derivative or analog, or to 

5 its reverse complement), under conditions of low stringency is provided. By way of 
example and not limitation, procedures using such conditions of low stringency are as 
follows (see also Shilo and Weinberg, 1981, Proc. Natl. Acad. Sci. U.S.A. 78, 6789-6792). 
Filters containing DNA are pretreated for 6 h at 40° C in a solution containing 35% 
formamide, 5X SSC, 50 mM Tris-HCi (pH 7.5), 5 mM EDTA, 0.1% PVP, 0.1% Ficoll, 1% 

10 BSA, and 500 ug/ml denatured salmon sperm DNA. Hybridizations are carried out in the 
same solution with the following modifications: 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 
ug/ml salmon sperm DNA, 10% (wt/vol) dextran sulfate, and 5-20 X 10 6 cpm 32 P-labeled 
probe is used. Filters are incubated in hybridization mixture for 18-20 h at 40 °C, and then 
washed for 1.5 h at 55 °C in a solution containing 2X SSC, 25 mM Tris-HCl (pH 7.4), 

1 5 5 mM EDTA, and 0.1% SDS. The wash solution is replaced with fresh solution and 
incubated an additional 1 .5 h at 60 °C. Filters are blotted dry and exposed for 
autoradiography. If necessary, filters are washed for a third time at 65-68 °C and re-exposed 
to film. Other conditions of low stringency which may be used are well known in the art 
(e.g., as employed for cross-species hybridizations). 

20 In another specific embodiment, a nucleic acid which is hybridizable to an 

al-2FucT nucleic acid, or its reverse complement, under conditions of high stringency is 
provided. By way of example and not limitation, procedures using such conditions of high 
stringency are as follows. Prehybridization of filters containing DNA is carried out for 8 h 
to overnight at 65 °C in buffer composed of 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM 

25 EDTA, 0.02% PVP, 0.02% Ficoll, 0.02% BSA, and 500 ug/ml denatured salmon sperm 
DNA. Filters are hybridized for 48 h at 65° C in prehybridization mixture containing 100 
ug/ml denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. Washing of 
filters is done at 37°C for 1 h in a solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, 
and 0.01% BSA. This is followed by a wash in 0.1X SSC at 50°C for 45 min before 

30 autoradiography. Other conditions of high stringency which may be used are well known in 
the art. 

In another specific embodiment, a nucleic acid which is hybridizable to an 
<xl-2FucT nucleic acid, or its reverse complement, under conditions of moderate stringency 
is provided. Selection of appropriate conditions for such stringencies is well known in the 
35 art (see e.g., Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, 2d Ed., Cold 
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Spring Harbor Laboratory Press, Cold Spring Harbor, New York; see also, Ausubel et al., 
eds., in the Current Protocols in Molecular Biology series of laboratory technique manuals, 
© 1987-1997, Current Protocols, © 1994-1997 John Wiley and Sons, Inc.). 

Nucleic acids encoding derivatives and analogs of ccl-2FucT proteins, and 
5 a 1 -2FucT antisense nucleic acids are additionally provided. As is readily apparent, as used 
herein, a "nucleic acid encoding a fragment or portion of an al^2FucT protein" shall be 
construed as referring to a nucleic acid encoding only the recited fragment or portion of the 
ccl-2FucT protein and not the other contiguous portions of the al-2FucT protein as a 
continuous sequence. 

10 In a preferred specific embodiment, after hybridization, wash conditions are 

as follows. Each membrane is washed two times each for 30 minutes each at 45 °C in 40 
mM sodium phosphate, pH 7,2, 5% SDS, 1 mM EDTA, 0.5% bovine serum albumin, 
followed by four washes each for 30 minutes in sodium phosphate, pH 7.2, 1% SDS, 1 mM 
EDTA, and subsequently each membrane is treated differently as described below 'for low, 

1 5 medium, or high stringency hybridization conditions. For low stringency hybridization, 
membranes are not washed further. For medium stringency hybridization, membranes are 
additionally subjected to four washes each for 30 minutes in 40 mM sodium phosphate, pH 
7.2, 1% SDS, 1 mM EDTA at 55°C. For high stringency hybridization, following the 
washes for low stringency, membranes are additionally subjected to four washes each for 30 

20 minutes in 40 mM sodium phosphate, pH 7.2, 1% SDS, 1 mM EDTA at 55 °C, followed by 
four washes each for 30 minutes in sodium phosphate, pH 7.2, 1% SDS, 1 mM EDTA at 
65°C. 

5.1.2. CLONING PROCEDURES 

25 Specific embodiments for the cloning of a 1 -2FucT nucleic acids follow. 

For expression cloning (a technique well known in the art), an expression library is 
constructed by any method known in the art. For example, mRNA is isolated, cDNA is 
made and ligated into an expression vector {e.g., a bacteriophage derivative) such that it is 
capable of being expressed by the host cell into which it is then introduced. Various 

30 screening assays can then be used to select for the expressed ccl-*2FucT product. In one 
embodiment, anti-cd-2FucT antibodies can be used for selection. 

In another embodiment, polymerase chain reaction (PCR) is used to amplify 
the desired sequence in a genomic or cDNA library, prior to selection. Oligonucleotide 
primers representing known al-2FucT sequences can be used as primers in PCR. In a 

35 preferred aspect, the oligonucleotide primers represent at least part of conserved segments 
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of strong homology between cd -2FucT genes of different species. Examples of useful 
primers are provided (the al-2FucT coding regions and complements thereof in SEQ ID 
NOs:l-6). The synthetic oligonucleotides may be utilized as primers to amplify sequences 
from a source (RNA or DNA), preferably a cDNA library, of potential interest. PCR can be 

5 carried out, e.g. , by use of a Perkin-Elmer Cetus thermal cycler and Taq polymerase (e.g. , 
Gene Amp™). The nucleic acid being amplified can include mRNA or cDNA or genomic 
DNA from any species. One may synthesize degenerate primers for amplifying homologs 
from other species in the PCR reactions. 

It is also possible to vary the stringency of hybridization conditions used in 

10 priming the PCR reactions, to allow for greater or lesser degrees of nucleotide sequence 
similarity between the known al-2FucT nucleotide sequences and a nucleic acid homolog 
(or ortholog) being isolated. For cross species hybridization, low stringency conditions are 
preferred. For same species hybridization, moderately stringent conditions are preferred. 
After successful amplification of a segment of an al-2FucT homolog, that segment may be 

15 cloned and sequenced by standard techniques, and utilized as a probe to isolate a complete 
cDNA or genomic clone. This, in turn, permits the determination of the gene's complete 
nucleotide sequence, the analysis of its expression, and the production of its protein product 
for functional analysis, as described below. In this fashion, additional nucleic acids 
encoding cd-2FucT proteins may be identified. 

20 The above-described methods are not meant to limit the following general 

description of methods by which clones of ccl~2FucT genes may be obtained. 

Any eukaryotic cell potentially can serve as the nucleic acid source for 
molecular cloning of al-2FucT nucleic acids. The nucleic acid sequences encoding 
cd-2FucT proteins may be isolated from vertebrate, mammalian, human, porcine, bovine, 

25 feline, avian, equine, canine, as well as additional primate sources, insects (e.g., 

Drosophila), invertebrates, plants, etc. The DNA may be obtained by standard procedures 
known in the art from cloned DNA (e.g., a DNA "library"), by chemical synthesis, by cDNA 
cloning, or by the cloning of genomic DNA, or fragments thereof, purified from the desired 
cell (see e.g., Sambrook et al., 1989, Molecular Cloning, A Laboratory Manual, 2d Ed., Vol. 

30 I, II, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York; Glover, ed., 
1985, DNA Cloning: A Practical Approach, MRL Press, Ltd., Oxford, U.K.). Clones 
derived from genomic DNA may contain regulatory and intron DNA regions in addition to 
coding regions; clones derived from cDNA will contain only exon sequences. Whatever the 
source, the nucleic acid should be molecularly cloned into a suitable vector for propagation 

35 of the nucleic acid sequence. 
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In the molecular cloning of the gene from genomic DNA, DNA fragments 
are generated, some of which will encode the desired gene. The DNA may be cleaved at 
specific sites using various restriction enzymes. Alternatively, one may use DNase in the 
presence of manganese to fragment the DNA, or the DNA can be physically sheared, as for 
5 example, by sonication. The linear DNA fragments can then be separated according to size 
by standard techniques, including but not limited to, agarose and polyacrylamide gel 
electrophoresis and column chromatography. 

Once the DNA fragments are generated, identification of the specific DNA 
fragment containing the desired nucleic acid may be accomplished in a number of ways. 
1 0 For example, if a portion of an a 1 -2FucT gene or its specific RNA or a fragment thereof is 
available and can be purified and labeled, the generated DNA fragments may be screened by 
nucleic acid hybridization to the labeled probe (Benton and Davis, 1977, Science 196:180; 
Grunstein and Hogness, 1975, Proc. Natl. Acad. Sci. U.S.A. 72:3961). Those DNA " 
fragments with substantial homology to the probe will hybridize. It is also possible to 
1 5 identify the appropriate fragment by restriction enzyme digestion(s) and comparison of 
fragment sizes with those expected according to a known restriction map if such is 
available. Further selection can be carried out on the basis of the properties of the gene. 

Alternatively, the presence of the desired nucleic acid may be detected by 
assays based on the physical, chemical, or immunological properties of its expressed 
20 product. For example, cDNA clones, or DNA clones which hybrid-select the proper 
mRNAs, can be selected and expressed to produce a protein that has, e.g., similar or 
identical electrophoretic migration, isoelectric focusing behavior, proteolytic digestion 
maps, catalytic activity, or antigenic properties as known for an cd-2FucT protein. Using 
an antibody to a known al-2FucT protein, other al-2FucT proteins may be identified by 
25 binding of the labeled antibody to expressed putative cd -2FucT proteins, e.g., in an ELISA 
(enzyme-linked immunosorbent assay)-type procedure. Further, using a binding protein 
specific to a known cd-2FucT protein, other cd-2FucT proteins may be identified by 
binding to such a protein (see e.g., Clemmons, 1993, Mol. Reprod. Dev. 35:368-374; 
Loddick et al., 1998, Proc. Natl. Acad. Sci. U.S.A. 95:1894-1898). 
30 An cd-2FucT nucleic acid can also be identified by mRNA selection using 

nucleic acid hybridization followed by in vitro translation. In this procedure, fragments are 
used to isolate complementary mRNAs by hybridization. Such DNA fragments may 
represent available, purified al~2FucT DNA of another species (e.g., mouse, human). 
Immunoprecipitation analysis or functional assays (e.g., catalytic activity, etc.) of the in 
35 vitro translation products of the isolated products of the isolated mRNAs identifies the 
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mRNA and, therefore, the complementary DNA fragments that contain the desired 
sequences. In addition, specific mRNAs may be selected by adsorption of polysomes 
isolated from cells to immobilized antibodies specifically directed against <xl-*2FucT 
protein. A radiolabeled al-*2FucT cDNA can be synthesized using the selected mRNA 
5 (from the adsorbed polysomes) as a template. The radiolabeled mRNA or cDNA may then 
be used as a probe to identify the al-2FucT DNA fragments from among other genomic 
DNA fragments. 

Alternatives to isolating the cd-2FucT genomic DNA include, but are not 
limited to, chemically synthesizing the nucleic acid sequence itself from a known sequence 

1 0 or making cDN A to the mRNA which encodes the a 1 - 2FucT protein. For example, RN A 
for cDNA cloning of the al-*2FucT gene can be isolated from cells which express the gene. 

The identified and isolated nucleic acid can then be inserted into an 
appropriate cloning vector. A large number of vector-host systems known in the art may be 
used. Possible vectors include, but are not limited to, plasmids or modified viruses, but the 

1 5 vector system must be compatible with the host cell used. Such vectors include, but are not 
limited to, bacteriophages such as lambda derivatives, or plasmids such as PBR322 or pUC 
plasmid derivatives or the Bluescript vector (Stratagene USA, La Jolla, California). The 
insertion into a cloning vector can, for example, be accomplished by ligating the DNA 
fragment into a cloning vector which has complementary cohesive termini. However, if the 

20 complementary restriction sites used to fragment the DNA are not present in the cloning 
vector, the ends of the DNA molecules may be enzymatically modified. Alternatively, any 
site desired may be produced by ligating nucleotide sequences (linkers) onto the DNA 
termini; these ligated linkers may comprise specific chemically synthesized oligonucleotides 
encoding restriction endonuclease recognition sequences. In an alternative method, the 

25 cleaved vector and an od-2FucT nucleic acid may be modified by homopolymeric tailing. 
Recombinant molecules can be introduced into host cells via transformation, transfection, 
infection, electroporation, etc., so that many copies of the nucleic acid sequence are 
generated. 

In an alternative method, the desired nucleic acid may be identified and 
30 isolated after insertion into a suitable cloning vector in a "shot gun" approach. Enrichment 
for the desired nucleic acid, for example, by size fractionization, can be done before 
insertion into the cloning vector. 

In an additional embodiment, the desired nucleic acid may be identified and 
isolated after insertion into a suitable cloning vector using a strategy that combines a "shot 
35 gun" approach with a "directed sequencing" approach. Here, for example, the entire DNA 
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sequence of a specific region of the genome, such as a sequence tagged site (STS), can be 
obtained using clones that molecularly map in and around the region of interest. 

In specific embodiments, transformation of host cells with recombinant DNA 
molecules that incorporate an isolated cd-2FucT gene, cDNA, or synthesized DNA 
5 sequence enables generation of multiple copies of the gene. Thus, the nucleic acid may be 
obtained in large quantities by growing transformants, isolating the recombinant DNA 
molecules from the transformants and, when necessary, retrieving the inserted nucleic acid 
from the isolated recombinant DNA. 

The al-2FucT sequences provided by the instant invention include those 
10 nucleotide sequences encoding substantially the same amino acid sequences as found in 
native cd-2FucT proteins, and those encoded amino acid sequences with functionally 
equivalent amino acids, as well as those encoding other al-2FucT derivatives or analogs, 
as described in below for cd-2FucT derivatives and analogs. 

1 5 5.2. EXPRESSION OF A RAT al-2FUCOSYLTRANSFERASE CODING 
SEQUENCE 

The nucleotide sequence coding for an ccl-2FucT protein or a functionally 
active analog or other derivative thereof (see Section 5.6), can be inserted into an 
appropriate expression vector, i.e., a vector which contains the necessary elements for the 

20 transcription and translation of the inserted protein-coding sequence. The necessary 

transcriptional and translational signals can also be supplied by the native ccl-2FucT gene 
and/or its flanking regions. A variety of host-vector systems may be utilized to express the 
protein-coding sequence. These include but are not limited to mammalian cell systems 
infected with virus {e.g., vaccinia virus, adenovirus, etc.); insect cell systems infected with 

25 virus {e.g. , baculovirus); microorganisms such as yeast containing yeast vectors, or bacteria 
transformed with bacteriophage, DNA, plasmid DNA, or cosmid DNA. The expression 
elements of vectors vary in their strengths and specificities. Depending on the host- vector 
system utilized, any one of a number of suitable transcription and translation elements may 
be used. In yet another embodiment, a fragment of an al-2FucT protein comprising one or 

30 more domains of the al-2FucT protein is expressed. 

Any of the methods previously described for the insertion of DNA fragments 
into a vector may be used to construct expression vectors containing a chimeric nucleic acid 
consisting of appropriate transcriptional/translational control signals and the protein coding 
sequences. These methods may include in vitro recombinant DNA and synthetic techniques 

35 and in vivo recombinants (genetic recombination). Expression of a nucleic acid sequence 
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encoding an cd-2FucT protein or peptide fragment may be regulated by a second nucleic 
acid sequence so that the al-2FucT protein or peptide is expressed in a host transformed 
with the recombinant DNA molecule. For example, expression of an od-2FucT protein 
may be controlled by any promoter/enhancer element known in the art. A 

5 promoter/enhancer may be homologous (i.e. native) or herterologous (i.e. not native). 
Promoters which may be used to control the expression of al-2FucT coding sequences 
include, but are not limited to, the SV40 early promoter region (Benoist and Chambon, 
1981, Nature 290:304-310), the promoter contained in the 3' long terminal repeat of Rous 
sarcoma virus (Yamamoto et al., 1980, Cell 22:787-797), the herpes thymidine kinase 

10 promoter (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the regulatory 
sequences of the metallothionein gene (Brinster et al., 1982, Nature 296:39-42), prokaryotic 
expression vectors such as the P-lactamase promoter (Villa-Kamaroff et al., 1978, Proc. 
Natl. Acad. Sci. U.S.A. 75:3727-3731), or the lac promoter (DeBoer et al., 1983, Proc. Natl. 
Acad. Sci. U.S.A. 80:21-25; Scientific American, 1980, 242:74-94), plant expression 

1 5 vectors comprising the nopaline synthetase promoter region (Herrera-Estrella et al., Nature 
303:209-213), the cauliflower mosaic virus 35S RNA promoter (Gardner et al., 1981, Nucl. 
Acids Res. 9:2871), and the promoter of the photosynthetic enzyme ribulose biphosphate 
carboxylase (Herrera-Estrella et al, 1984, Nature 310:1 15-120), promoter elements from 
yeast or other fungi such as the Gal4-responsive promoter, the ADC (alcohol 

20 dehydrogenase) promoter, PGK (phosphoglycerol kinase) promoter, alkaline phosphatase 
promoter, and the following animal transcriptional control regions, which exhibit tissue 
specificity and have been utilized in transgenic animals: elastase I gene control region which 
is active in pancreatic acinar cells (Swift et al., 1984, Cell 38:639-646; Ornitz et al., 1986, 
Cold Spring Harbor Symp. Quant. Biol. 50:399-409; MacDonald, 1987, Hepatology 7:425- 

25 5 1 5); a gene control region which is active in pancreatic beta cells (Hanahan, 1985, Nature 
315:1 15-122), an immunoglobulin gene control region which is active in lymphoid ceils 
(Grosschedl et al., 1984, Cell 38:647-658; Adames et al., 1985, Nature 318:533-538; 
Alexander et al., 1987, Mol. Cell. Biol. 7:1436-1444), mouse mammary tumor virus control 
region which is active in testicular, breast, lymphoid and mast cells (Leder et al, 1986, Cell 

30 45:485-495), albumin gene control region which is active in liver (Pinkert et al., 1987, 

Genes and Devel. 1:268-276), alpha-fetoprotein gene control region which is active in liver 
(Krumlauf et al., 1985, Mol. Cell. Biol. 5:1639-1648; Hammer et al., 1987, Science 235:53- 
58), alpha 1-antitrypsin gene control region which is active in the liver (Kelsey et al., 1987, 
Genes and Devel. 1:161-171), beta-globin gene control region which is active in myeloid 
35 cells (Mogram et al., 1985, Nature 315:338-340; Kollias et al., 1986, Cell 46:89-94), myelin 
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basic protein gene control region which is active in oligodendrocyte cells in the brain 
(Readhead et al., 1987, Cell 48:703-712); myosin light chain-2 gene control region which is 
active in skeletal muscle (Sani, 1985, Nature 314:283-286), and gonadotropic releasing 
hormone gene control region which is active in the hypothalamus (Mason et al., 1986, 
5 Science 234:1372-1378). 

In a specific embodiment, a vector is used that comprises a promoter 
operably linked to an al-*2FucT nucleic acid, one or more origins of replication, and, 
optionally, one or more selectable markers (e.g., an antibiotic resistance gene). 

In a specific embodiment, the promoter that is operably linked to the rat 
10 al-2FucT nucleic acid is not the native rat al-2FucT gene promoter (i.e. it is a 
heterologous promoter). 

In a specific embodiment, an expression construct is made by subcloning an 
al-2FucT coding sequence into the EcoBl restriction site of the pPROTA mammalian cell 
expression vector (Henion et al., 1994, Glycobiology 4:193-202). This allows for the 
1 5 expression of the al -2FucT protein product from the subclone fused to the IgG binding 
domain of protein A. 

In another specific embodiment, an expression construct is made by 
subcloning an al-2FucT coding sequence into the pcDNA3 expression vector (Invitrogen 
Corp., Carlsbad, California). This allows for high level expression of the cd-2FucT protein 
20 product from the subclone. 

In another specific embodiment, an expression construct is made by 
subcloning an al-2FucT coding sequence into the pichia pPIC9 expression vector 
(Invitrogen Corp., Carlsbad, California). This allows for high level expression of the 
al-2FucT protein product from the subclone. 
25 Expression vectors containing a 1 -2FucT coding sequence inserts can be 

identified by four general approaches: (a) nucleic acid hybridization; (b) molecular biology, 
(c) expression of inserted sequences; and (d) presence or absence of "marker" gene 
functions . In the first approach, the presence of an al-2FucT nucleic acid inserted in an 
expression vector can be detected by nucleic acid hybridization using probes comprising 
30 sequences that are homologous to an inserted al -2FucT nucleic acid. In the second 
approach, a combination of molecular biology and "marker" gene function are used to 
identify recombinant expression vectors containing the al-2FucT insert. For example, if 
the al-2FucT nucleic acid is inserted in the EcoKL site of the pcDNA3 vector, which codes 
for both Ampicillin and Neomycin resistance, bacterial cells that take up the vector are 
35 identified by their resistance to Ampicillin and/or Neomycin, and those vectors containing 
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the al-*2FucT insert can be identified by restriction digestion of the amplified vector DNA 
with EcoKL. In the third approach, recombinant expression vectors can be identified by 
assaying the al-2FucT product expressed by the recombinant. Such assays can be based, 
for example, on the physical or functional properties of the al-2FucT protein in in vitro 
5 assay systems, e.g., the catalysis of fucosyl-GM, synthesis. In the fourth approach, the 
vector/host system can be identified based upon the presence or absence of certain "marker" 
gene functions (e.g., thymidine kinase activity, {3-galactosidase, resistance to antibiotics, 
transformation phenotype, occlusion body formation in baculovirus, etc.) caused by the 
insertion of an al-2FucT nucleic acid in the vector. For example, if the cd~2FucT nucleic 

10 acid is inserted within the marker gene sequence of the vector, recombinants containing the 
cd-2FucT insert can be identified by the absence of the marker gene function. 

Once a particular recombinant DNA molecule is identified and isolated, 
several methods known in the art may be used to propagate it. Once a suitable host system 
and growth conditions are established, recombinant expression vectors can be propagated 

1 5 and prepared in quantity. As previously explained, the expression vectors which can be 
used include, but are not limited to, the following vectors or their derivatives: human or 
animal viruses such as vaccinia virus or adenovirus; insect viruses such as baculovirus; 
yeast vectors; bacteriophage vectors (e.g., lambda phage), and plasmid and cosmid DNA 
vectors, to name but a few. 

20 In addition, a host cell strain may be chosen which modulates the expression 

of the inserted sequences, or modifies and processes the nucleic acid product in the specific 
fashion desired. Expression from certain promoters can be elevated in the presence of 
certain inducers; thus, expression of the genetically engineered al-2FucT protein may be 
controlled. Furthermore, different host cells have characteristic and specific mechanisms 

25 for the translational and post-translational processing and modification (e.g. , glycosylation) 
of proteins. Appropriate cell lines or host systems can be chosen to ensure the desired 
modification and processing of the foreign protein expressed. For example, expression in a 
bacterial system can be used to produce a soluble al~2FucT catalytic domain. Expression 
in animal cells can be used to ensure folding, proper membrane insertion and glycosylation 

30 ofal-2FucT. 

In other specific embodiments, the al-2FucT protein, derivative or analog 
may be expressed as a fusion, or chimeric protein product (comprising the protein, 
derivative or analog joined via a covalent bond such a peptide bond to a heterologous 
protein sequence (of a different protein)). A chimeric protein may include fusion of the 
35 al -2FucT protein, derivative or analog to a second protein or at least a portion thereof, 
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wherein a portion is one (preferably 10, 15, or 20) or more amino acids of said second 
protein. The second protein, or one or more amino acid portion thereof, may be from a 
different rat <xl-2FucT protein or may be from a protein that is not a rat al-2FucT protein. 
Such a chimeric product can be made by ligating the appropriate nucleic acid sequences 

5 encoding the desired amino acid sequences to each other by methods known in the art, in the 
proper coding frame, and expressing the chimeric product by methods commonly known in 
the art. Alternatively, such a chimeric product may be made by protein synthetic 
techniques, e.g., by use of a peptide synthesizer. In a specific embodiment, the amino acid 
portion of the second protein is one that allows for the extracellular secretion of the 

10 al-2FucT catalytic domain, e.g. the Ig binding domain of protein A (Henion et al., 1994, 
Glycobiology 4:193-202). In a specific embodiment, the amino acid portion of the second 
protein is one that allows for the membrane localization of the ccl-2FucT catalytic domain, 
e.g. the type I transmembrane domain of sevenless (Basler et al., 1991) or Notch (reviewed 
by Weinmaster, 1997, Mol. Cell. Neurosci. 9:91-102), the type II transmembrane domain of 

15 human H-type al-2fucosyltransferase (Koda et al., 1997, Eur. J. Biochem. 300:623-626), 
or the myristylation signal of src proteins (Cross et al., 1984, Mol. Cell Biol. 4:1834-1842; 
Simon et al., 1985, Cell 42:831-840). 



5.3. IDENTIFICATION AND PURIFICATION OF RAT <xl-2FucT 
PRODUCTS . 

20 

In particular aspects, the invention provides amino acid sequences of 
ccl-2FucT proteins and derivatives or analogs thereof which comprise an antigenic 
determinant (i.e., can be recognized by an antibody) or which are otherwise functionally 
active, as well as nucleic acid sequences encoding the foregoing. "Functionally active" 

25 al -2FucT material as used herein refers to that material displaying one or more functional 
activities associated with a full-length (wild-type) cd-2FucT protein, e.g., enzymatic ability 
to transfer fucose or a fucosyl moiety in an a 1-2 linkage to a terminal galactose of a 
Gaipi^3GalNAc moiety, e.g. GM„ with specificity, etc. 

3Q Once a recombinant nucleic acid which expresses the al -2FucT coding 

sequence is identified, the product can be analyzed. This is achieved by assays based on the 
physical or functional properties of the product, including radioactive labeling of the 
product followed by analysis by gel electrophoresis, immunoassay, etc. 

Once the al-2FucT protein is identified, it may be isolated and purified by 

35 standard methods including chromatography (e.g. , ion exchange, affinity, and sizing column 
chromatography), centrifugation, differential solubility, or by any other standard technique 
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for the purification of proteins. The functional properties may be evaluated using any 
suitable assay (see, e.g., Section 5.7). 

Alternatively, once an al-2FucT protein produced by a recombinant is 
identified, the amino acid sequence of the protein can be deduced from the nucleotide 
5 sequence of the chimeric nucleic acid. As a result, the protein can be synthesized by 
standard chemical methods known in the art (e.g., see Hunkapiller et aL, 1984, Nature 
310:105-111). 

In another alternate embodiment, native cd-2FucT proteins can be purified 
from natural sources, by standard methods such as those described above (e.g., 
1 0 immunoaffinity purification). 

In a specific embodiment of the present invention, such <xl-2FucT proteins, 
whether produced by recombinant DNA techniques or by chemical synthetic methods or by 
purification of native proteins, include but are not limited to those containing, as a primary 
amino acid sequence, all or part of the amino acid sequence substantially as depicted in 
15 Figure 5 (SEQ ID NO: 8), as well as derivatives and analogs thereof, including proteins 
homologous thereto. 



5.4. STRUCTURE OF «1->2FucT NUCLEIC AC IDS AND PROTEINS 

20 The structure of <xl -2FucT nucleic acids and proteins of the invention can be 

analyzed by various methods known in the art. Some examples of such methods are 
described below. 

5.4.1. GENETIC ANALYSIS 

25 The cloned DNA or cDNA corresponding to an cd -2FucT nucleic acid can 

be analyzed by methods including but not limited to Southern hybridization (Southern, 
1975, J. Mol. Biol. 98:503-517), Northern hybridization (see e.g., Freeman et al., 1983, 
Proc. Natl. Acad. Sci. U.S.A. 80:4094-4098), restriction endonuclease mapping (Maniatis, 
1982, Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Laboratory Press, 

30 Cold Spring Harbor, New York), and DNA sequence analysis. Accordingly, this invention 
provides nucleic acid probes recognizing an ocl-2FucT nucleic acid. For example, 
polymerase chain reaction (PCR; U.S. Patent Nos. 4,683,202, 4,683,195 and 4,889,818; 
Gyllenstein et al., 1988, Proc. Natl. Acad. Sci. U.S.A. 85:7652-7656; Ochman et al., 1988, 
Genetics 120:621-623; Loh et al., 1989, Science 243:217-220) followed by Southern 

35 hybridization with an al-2FucT-specific probe can allow the detection of an cd-2FucT 
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gene in DNA from various cell types. Methods of amplification other than PCR are 
commonly known and can also be employed. In one embodiment, Southern hybridization 
can be used to determine the genetic linkage of an <xl-2FucT gene. Northern hybridization 
analysis can be used to determine the expression of an cd-2FucT gene. Various cell types, 
5 at various states of development or activity can be tested for a 1 -2FucT gene expression. 
The stringency of the hybridization conditions for both Southern and Northern hybridization 
can be manipulated to ensure detection of nucleic acids with the desired degree of 
relatedness to the specific <xl-2FucT-probe used. Modifications of these methods and other 
methods commonly known in the art can be used. 
I o Restriction endonuclease mapping can be used to roughly determine the 

genetic structure of an cd-2FucT nucleic acid. Restriction maps derived by restriction 
endonuclease cleavage can be confirmed by DNA sequence analysis. 

DNA sequence analysis can be performed by any techniques known in the 
art, including but not limited to the method of Maxam and Gilbert (1980, Meth. Enzymol. 
1 5 65:499-560), the Sanger dideoxy method (Sanger et al., 1977, Proc. Natl. Acad. Sci. U.S.A. 
74:5463), the use of T7 DNA polymerase (Tabor and Richardson, U.S. Patent No. 
4,795,699), or use of an automated DNA sequenator (e.g. , Applied Biosystems, Foster City, 
California). 

20 5.4.2. PROTEIN ANALYSIS 

The amino acid sequence of an od-2FucT protein can be derived by 
deduction from the DNA sequence, or alternatively, by direct sequencing of the protein, e.g. , 
with an automated amino acid sequencer. 

An cd-2FucT protein sequence can be further characterized by a 
25 hydrophilicity analysis (Hopp and Woods, 1981, Proc. Natl. Acad. Sci. U.S.A. 78:3824). A 
hydrophilicity profile can be used to identify the hydrophobic and hydrophilic regions of the 
cd-2FucT protein and the corresponding regions of the gene sequence which encode such 
regions. 

Structural prediction analysis (Chou and Fasman, 1974, Biochemistry 
30 13 :222) can also be done, to identify regions of an a 1 -2FucT protein that assume specific 
secondary structures. 

Manipulation, translation, and secondary structure prediction, open reading 
frame prediction and plotting, as well as determination of sequence homologies, can also be 
accomplished using computer software programs available in the art. 

35 
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Other methods of structural analysis can also be employed. These include 
but are not limited to X-ray crystallography (Engstom, 1974, Biochem. Exp. Biol. 1 1:7-13), 
nuclear magnetic resonance spectroscopy (Clore and Gonenborn, 1989, CRC Crit. Rev. 
Biochem. 24:479-564) and computer modeling (Fletterick and Zoller, 1986, Computer 
5 Graphics and Molecular Modeling, in Current Communications in Molecular Biology, Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, New York). 

5.5. ANTIBODIES 

According to the invention, al-2FucT protein, its derivatives, or analogs 
10 thereof, may be used as an immunogen to generate antibodies which immunospecifically 
bind such an immunogen. Such antibodies include but are not limited to polyclonal, 
monoclonal, chimeric, single chain, Fab fragments, and an Fab expression library. In 
another embodiment, antibodies to a domain (e.g., an al-2FucT receptor binding domain) 
of an cd-2FucT protein are produced. In a specific embodiment, fragments of an ' 
15 a 1 -2FucT protein identified as hydrophilic are used as immunogens for antibody 
production. 

Various procedures known in the art may be used for the production of 
polyclonal antibodies to an <xl-2FucT protein or derivative or analog. In a particular 
embodiment, rabbit polyclonal antibodies to an epitope of an al-2FucT protein consisting 

20 of the sequence of SEQ ID NO:2, or a subsequence thereof, can be obtained. For the 

production of antibody, various host animals can be immunized by injection with the native 
al-2FucT protein, or a synthetic version, or derivative thereof, including but not limited to 
rabbits, mice, rats, etc. Various adjuvants may be used to increase the immunological 
response, depending on the host species, and including but not limited to Freund's (complete 

25 and incomplete), mineral gels such as aluminum hydroxide, surface active substances such 
as lysolecithin, pluronic polyols, polyanions, peptides, oil emulsions, keyhole limpet 
hemocyanins, dinitrophenol, and potentially useful human adjuvants such as BCG (bacille 
Calmette-Guerin) and corynebacterium parvum. 

For preparation of monoclonal antibodies directed to an ocl~2FucT protein 

30 sequence or analog thereof, any technique which provides for the production of antibody 
molecules by continuous cell lines in culture may be used. For example, the hybridoma 
technique originally developed by Kohler and Milstein, (Kohler and Milstein 1975, Nature 
256:495-497), as well as the trioma technique, the human B-cell hybridoma technique 
(Kozbor et al., 1983, Immunology Today 4:72), and the EBV-hybridoma technique to 

35 produce human monoclonal antibodies (Cole et al., 1985, in Monoclonal Antibodies and 
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Cancer Therapy, Alan R. Liss, Inc., pp. 77-96). In an additional embodiment of the 
invention, monoclonal antibodies can be produced in germ-free animals utilizing recent 
technology {see e.g., PCT/US90/02545). According to the invention, human antibodies may 
be used and can be obtained by using human hybridomas (Cole et al., 1983, Proc. Natl. 
5 Acad. Sci. U.S.A. 80:2026-2030) or by transforming human B cells with EBV virus in vitro 
(Cole et al, 1985, in Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, pp. 77-96). 
In fact, according to the invention, techniques developed for the production of "chimeric 
antibodies" (Morrison et al, 1984, Proc. Natl. Acad. Sci. U.S.A. 81:6851-6855; Neuberger 
et al., 1984, Nature 3 12:604-608; Takeda et al., 1985, Nature 3 14:452-454) by splicing the 
10 genes from a mouse antibody molecule specific for an al-2FucT protein together with 
genes from a human antibody molecule of appropriate biological activity can be used; such 
antibodies are within the scope of this invention. 

According to the invention, techniques described for the production of single 
chain antibodies (U.S. Patent No. 4,946,778) can be adapted to produce al-2FucT-specific 
15 single chain antibodies. An additional embodiment of the invention utilizes the techniques 
described for the construction of Fab' expression libraries (Huse et al., 1989, Science 
246:1275-1281) to allow rapid and easy identification of monoclonal Fab fragments with 
the desired specificity for al-2FucT proteins, derivatives, or analogs. 

Antibody fragments which contain the idiotype of the molecule can be 
20 generated by known techniques. For example, such fragments include but are not limited to, 
the F(ab') 2 fragment which can be produced by pepsin digestion of the antibody molecule, 
the Fab' fragments which can be generated by reducing the disulfide bridges of the F(ab') 2 
fragment, the Fab fragments which can be generated by treating the antibody molecule with 
papain and a reducing agent, and Fv fragments. 
25 In the production of antibodies, screening for the desired antibody can be 

accomplished by techniques known in the art {e.g., enzyme-linked immunosorbent assay or 
ELISA). For example, to select antibodies which recognize a specific domain of a 
al -2FucT protein, one may assay generated hybridomas for a product which binds to a 
al-2FucT fragment containing such domain. For selection of an antibody that specifically 
30 binds a first cd-2FucT homolog but which does not specifically bind a different cd-2FucT 
homolog, one can select on the basis of positive binding to the first ccl-2FucT homolog and 
a lack of binding to the second al-2FucT homolog. 

Antibodies specific to a domain of an al-2FucT protein are also provided. 
Antibodies specific to an epitope of an cd-2FucT protein are also provided. 

35 
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The foregoing antibodies can be used in methods known in the art relating to 
the localization and activity of the cd-2FucT protein sequences of the invention, e.g., for 
imaging these proteins, measuring levels thereof in appropriate physiological samples, in 
diagnostic methods, etc. 

5 

5.6. al-2FucT PROTEINS AND DERIVATIVES 

The invention further encompasses al-2FucT proteins, derivatives, analogs, 
and molecules of cd-2FucT proteins. As used herein, a molecule defined by a particular 
SEQ ID NO, shall be construed to mean that the sequence of that molecule comprises that 

10 SEQ ID NO, unless explicitly indicated otherwise to mean that the sequence of the molecule 
consists of that SEQ ID NO. Nucleic acids encoding al-2FucT protein derivatives and 
protein analogs are also provided. In one embodiment, the al-2FucT proteins are encoded 
by the al-2FucT nucleic acids described in Section 5.1 above. In particular aspects, the 
proteins, derivatives, or analogs are of ccl-2FucT proteins encoded by the amino "acid 

15 sequence of (SEQ ID NO:8). 

The production and use of derivatives and analogs related to an al-2FucT 
protein are within the scope of the present invention. In a specific embodiment, the 
derivative or analog is functionally active, i.e., capable of exhibiting one or more functional 
activities associated with a full-length, wild-type cd-2FucT protein. As one example, such 

20 derivatives or analogs which have the desired immunogenicity or antigenicity can be used in 
immunoassays, for immunization, for inhibition of al-2FucT activity, etc. As another 
example, such derivatives or analogs which have the desired binding activity can be used for 
binding to the InR gene product. As yet another example, such derivatives or analogs which 
have the desired binding activity can be used for binding to a binding protein specific for a 

25 known al-2FucT protein {see e.g., Clemmons, 1993, Mol. Reprod. Dev. 35:368-374; 
Loddick et al., 1998, Proc. Natl. Acad. Sci. U.S.A. 95:1894-1898). Derivatives or analogs 
that retain, or alternatively lack or inhibit, a desired <xl-2FucT protein property-of-interest 
{e.g., binding to an cd-2FucT protein binding partner), can be used as inducers, or 
inhibitors, respectively, of such property and its physiological correlates. A specific 

30 embodiment relates to an a 1 -2FucT protein fragment that can be bound by an anti- 

al-2FucT protein antibody. Derivatives or analogs of an ccl-2FucT protein can be tested 
for the desired activity by procedures known in the art, including but not limited to the 
assays described in Section (5.10 and 5.1 1 below) 

In particular, al-2FucT derivatives can be made by altering od-2FucT 

35 sequences by substitutions, additions {e.g., insertions) or deletions that provide for 
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functionally equivalent molecules. Due to the degeneracy of nucleotide coding sequences, 
other DNA sequences which encode substantially the same amino acid sequence as an 
cd-2FucT nucleic acid may be used in the practice of the present invention. These include 
but are not limited to nucleotide sequences comprising all or portions of an al-2FucT 

5 nucleic acid which is altered by the substitution of different codons that encode a 

functionally equivalent amino acid residue within the sequence, thus producing a silent 
change. Likewise, the ccl-2FucT derivatives of the invention include, but are not limited to, 
those containing, as a primary amino acid sequence, all or part of the amino acid sequence 
of an al-2FucT protein including altered sequences in which functionally equivalent amino 

10 acid residues are substituted for residues within the sequence resulting in a silent change. 
For example, one or more amino acid residues within the sequence can be substituted by 
another amino acid of a similar polarity which acts as a functional equivalent, resulting in a 
silent alteration. Substitutions for an amino acid within the sequence may be selected from 
other members of the class to which the amino acid belongs. For example, the nonpolar 

15 (hydrophobic) amino acids include alanine, leucine, isoleucine, valine, proline, 

phenylalanine, tryptophan and methionine. The polar neutral amino acids include glycine, 
serine, threonine, cysteine, tyrosine, asparagine, and glutamine. The positively charged 
(basic) amino acids include arginine, lysine and histidine. The negatively charged (acidic) 
amino acids include aspartic acid and glutamic acid. Such substitutions are generally 

20 understood to be conservative substitutions. 

In a specific embodiment of the invention, proteins consisting of or 
comprising a fragment of an cd-2FucT protein consisting of at least 10 (continuous) amino 
acids of the al-2FucT protein are provided. In other embodiments, the fragment consists 
of at least 20 or at least 50 amino acids of the cd-2FucT protein. In specific embodiments, 

25 such fragments are not larger than 35, 100 or 200 amino acids. Derivatives or analogs of 
al-2FucT proteins include but are not limited to those molecules comprising regions that 
are substantially homologous to an ccl-2FucT protein or fragment thereof (e.g., in various 
embodiments, at least 60% or 70% or 80% or 90% or 95% identity over an amino acid 
sequence of identical size or when compared to an aligned sequence in which the alignment 

30 is done by a computer homology program known in the art) or whose encoding nucleic acid 
is capable of hybridizing to a coding al~2FucT gene sequence, under high stringency, 
moderate stringency, or low stringency conditions. 

Specifically, by way of example computer programs for determining 
homology may include but are not limited to TBLASTN, BLASTP, FASTA, TFASTA, and 

35 CLUSTALW (Altschul et al., 1990, J. Mol. Biol. 215(3):403-10; see, Pearson and Lipman, 
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1988, Proc. Natl. Acad. Sci. USA 85(8):2444-8; Thompson, et al., 1994, Nucleic Acids Res. 
22(22):4673-80; Higgins, et al., 1996, Methods Enzymol 266:383-402). 

Basic Local Alignment Search Tool (BLAST) (www.ncbi.nlm.nih.gov) 
(Altschul et al., 1990, J. of Molec. Biol., 215:403-410, "The BLAST Algorithm"; Altschul 
5 et al., 1997, Nuc. Acids Res. 25:3389-3402) is a heuristic search algorithm tailored to 
searching for sequence similarity which ascribes significance using the statistical methods 
of Karlin and Altschul 1990, Proc. Nat'l Acad. Sci. USA, 87:2264-68; 1993, Proc. Nat'l 
Acad. Sci. USA 90:5873-77. Five specific BLAST programs perform the following tasks: 
1) The BLASTP program compares an amino acid query sequence against a protein 
1 0 sequence database; 2) The BLASTN program compares a nucleotide query sequence against 
a nucleotide sequence database; 3) The BLASTX program compares the six-frame 
conceptual translation products of a nucleotide query sequence (both strands) against a 
protein sequence database; 4) The TBLASTN program compares a protein query sequence 
against a nucleotide sequence database translated in all six reading frames (both strands); 5) 
1 5 The TBLASTX program compares the six-frame translations of a nucleotide query sequence 
against the six-frame translations of a nucleotide sequence database. 

Smith- Waterman (database: European Bioinformatics Institute 
wwwz.ebi.ac.uk/bic_sw/) (Smith-Waterman, 1981, J. of Molec. Biol., 147:195-197) is a 
mathematically rigorous algorithm for sequence alignments. 
20 FASTA (see Pearson et al., 1988, Proc. Nat'l Acad. Sci. USA, 85:2444-2448) 

is a heuristic approximation to the Smith- Waterman algorithm. 

For a general discussion of the procedure and benefits of the BLAST, 
Smith- Waterman and FASTA algorithms see Nicholas et al., 1998, "A Tutorial on 
Searching Sequence Databases and Sequence Scoring Methods" (www.psc.edu) and 
25 references cited therein. 

The <xl-2FucT derivatives and analogs of the invention can be produced by 
various methods known in the art. The manipulations which result in their production can 
occur at the gene or protein level. For example, a cloned al-2FucT nucleic acid sequence 
can be modified by any of numerous strategies known in the art (Sambrook et al., 1989, 
30 Molecular Cloning, A Laboratory Manual, 2d ed., Cold Spring Harbor Laboratory Press, 
Cold Spring Harbor, New York). The sequence can be cleaved at appropriate sites with 
restriction endonuclease(s), followed by further enzymatic modification if desired, isolated, 
and ligated in vitro. In the production of a modified nucleic acid encoding a derivative or 
analog of an al-2FucT protein, care should be taken to ensure that the modified nucleic 
35 acid remains within the same translational reading frame as the native protein, uninterrupted 
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by translational stop signals, in the gene region where the desired al-2FucT protein activity 
is encoded. 

Additionally, an al-2FucT nucleic acid sequence can be mutated in vitro or 
in vivo, to create and/or destroy translation, initiation, and/or termination sequences, or to 
5 create variations in coding regions and/or to form new restriction endonuclease sites or 
destroy preexisting ones, to facilitate further in vitro modification. Any technique for 
mutagenesis known in the art can be used, including but not limited to, chemical 
mutagenesis, in vitro site-directed mutagenesis (Hutchinson et al, 1978, J. Biol. Chem. 
253:6551), use of TAB® linkers (Pharmacia), PCR with primers containing a mutation, etc. 
1 o Manipulations of an al -2FucT protein sequence may also be made at the 

p protein level. Included within the scope of the invention are al -2FucT protein fragments 

or other derivatives or analogs which are differentially modified during or after translation, 
O e.g. , by glycosylation, acetylation, phosphorylation, amidation, derivatization by known 

protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule'or other 
yj 15 cellular ligand, etc. Any of numerous chemical modifications may be carried out by known 

r\ techniques, including but not limited to specific chemical cleavage by cyanogen bromide, 

U trypsin, chymotrypsin, papain, V8 protease, NaBH 4 , acetylation, formylation, oxidation, 

reduction, metabolic synthesis in the presence of tunicamycin, etc. 
O In addition, analogs and derivatives of an al-2FucT protein can be 

20 chemically synthesized. For example, a peptide corresponding to a portion of an a 1 -2FucT 
protein which comprises the desired domain, or which mediates the desired activity in vitro, 
can be synthesized by use of a peptide synthesizer. Furthermore, if desired, nonclassical 
amino acids or chemical amino acid analogs can be introduced as a substitution or addition 
into the al-2FucT sequence. Non-classical amino acids include but are not limited to the 
25 D-isomers of the common amino acids, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 

2- amino butyric acid, y-Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 

3- amino propionic acid, ornithine, norleucine, norvaline, hydroxyproline, sarcosine, 
citrulline, cysteic acid, t-butylglycine, t-butylalanine, phenylglycine, cyclohexylalanine, p- 
alanine, fluoro-amino acids, designer amino acids such as P-methyl amino acids, Ca-methyl 

30 amino acids, Na-methyl amino acids, and amino acid analogs in general. Furthermore, the 
amino acid can be D (dextrorotary) or L (levorotary). 

In a specific embodiment, an <xl-2FucT protein derivative is a chimeric or 
fusion protein comprising an al-2FucT protein or fragment thereof (preferably consisting 
of at least a domain or motif of the al-2FucT protein, or at least 10 amino acids of the 

35 a 1 -2FucT protein) joined at its amino- or carboxy-terminus via a peptide bond to an amino 
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acid sequence of a different protein. In specific embodiments, the amino acid sequence of 
the different protein is at least 6, 10, 20 or 30 continuous amino acids of the different 
proteins or a portion of the different protein that is functionally active. In one embodiment, 
such a chimeric protein is produced by recombinant expression of a nucleic acid encoding 

5 the protein (comprising an ccl -2FucT-coding sequence joined in-frame to a coding 
sequence for a different protein). Such a chimeric product can be made by ligating the 
appropriate nucleic acid sequences encoding the desired amino acid sequences to each other 
by methods known in the art, in the proper coding frame, and expressing the chimeric 
product by methods commonly known in the art. Alternatively, such a chimeric product 

1 0 may be made by protein synthetic techniques, e.g. , by use of a peptide synthesizer. 
Chimeric genes comprising the whole al-2FucT open reading frame or the nucleotides 
encoding the catalytic domain fused to any heterologous protein-encoding sequences may be 
constructed. 

In another specific embodiment, the al-2FucT derivative is a molecule 
1 5 comprising a region of homology with the full length or catalytic domain of cd -2FucT 
protein. By way of example, in various embodiments, a first protein region can be 
considered "homologous" to a second protein region when the amino acid sequence of the 
first region is at least 30%, 40%, 50%, 60%, 70%, 75%, 80%, 90%, or 95% identical, when 
compared to any sequence in the second region of an equal number of amino acids as the 
20 number contained in the first region or when compared to an aligned sequence of the second 
region that has been aligned by a computer homology program known in the art. For 
example, a molecule can comprise one or more regions homologous to an al-2FucT 
catalytic domain (see Section 5.6.1) or a portion thereof. 

25 5.7. ELIMINATION OF al-2FucT ACTIVITY 

The present invention provides for methods of creating cells lacking 
al-*2fucosyltransferase activity. 

In one embodiment, loss-of-function phenotypes are generated by antisense 
RNA methods (Schubiger and Edgar, 1994, Methods in Cell Biology 44:697-713). One 
30 form of the antisense RNA method involves the injection of cells with an antisense RNA 
that is partially homologous to the gene-of-interest (in this case an al-2FucT nucleic acid). 
Another form of the antisense RNA method involves expression of an antisense RNA 
partially homologous to the gene-of-interest by operably joining a portion of the gene-of- 
interest in the antisense orientation to a powerful promoter that can drive the expression of 
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large quantities of antisense RNA, either generally throughout the animal or in specific 
tissues. 

In a second embodiment, ioss-of-function phenotypes are generated by 
cosuppression methods (Bingham, 1997, Cell 90(3):385-7; Smyth, 1997, Curr. Biol. 
5 7(12):793-5; Que and Jorgensen, 1998, Dev. Genet. 22(l):100-9). Cosuppression is a 
phenomenon of reduced gene expression produced by expression or injection of a sense 
strand RNA corresponding to a partial segment of the gene-of-interest. Cosuppression 
effects have been employed extensively in plants to generate loss-of-function phenotypes. 

10 5.7.1. ANTISENSE REGULATION OF GENE E XPRESSION 

The invention provides for antisense uses of rat ccl-2FucT nucleic acids. In a 
specific embodiment, an ocl-2FucT protein function is inhibited by use of al-2FucT 
antisense nucleic acids. The present invention provides for use of nucleic acids of at least 
six nucleotides that are antisense to a gene or cDNA encoding an cd-2FucT protein or a 

1 5 portion thereof. An ccl -2FucT "antisense" nucleic acid as used herein refers to a nucleic 
acid capable of hybridizing to a sequence-specific (i.e. non-poly A) portion of an al-2FucT 
RNA (preferably mRNA) by virtue of some sequence complementarity. Antisense nucleic 
acids may also be referred to as inverse complement nucleic acids. The antisense nucleic 
acid may be complementary to a coding and/or noncoding region of an al-2FucT mRNA. 

20 Such antisense nucleic acids have utility in inhibiting an al -2FucT protein function. 

The antisense nucleic acids of the invention can be oligonucleotides that are 
double-stranded or single-stranded, RNA or DNA or a modification or derivative thereof, 
which can be directly administered to a cell. The al-2FucT antisense nucleic acids of the 
invention are preferably oligonucleotides (ranging from 6 to about 50 oligonucleotides). In 

25 specific aspects, an oligonucleotide is at least 10 nucleotides, at least 15 nucleotides, at least 
100 nucleotides, or at least 200 nucleotides in length. The oligonucleotide can be DNA or 
RNA or chimeric mixtures or derivatives or modified versions thereof, or single-stranded or 
double-stranded. The oligonucleotide can be modified at the base moiety, sugar moiety, or 
phosphate backbone. The oligonucleotide may include other appending groups such as 

30 peptides, or agents facilitating transport across the cell membrane (see e.g., Letsinger et al., 
1989, Proc. Natl. Acad. Sci. U.S.A. 86:6553-6556; Lemaitre et al., 1987, Proc. Natl. Acad. 
Sci. U.S.A. 84:648-652; PCT Publication No. WO 88/09810, published December 15, 
1988) or the blood-brain barrier (see e.g., PCT Publication No. WO 89/10134, published 
April 25, 1988), hybridization-triggered cleavage agents (see e.g., Krol et al., 1988, 
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BioTechniques 6:958-976) or intercalating agents (see e.g., Zon, 1988, Pharm. Res. 5:539-549). 

In a preferred aspect of the invention, an al-2FucT antisense 
oligonucleotide is provided as single-stranded DNA. In another preferred aspect, such an 
oligonucleotide comprises a sequence antisense to the sequence encoding a B peptide 

5 domain or an A peptide domain of an a 1 -2FucT protein. The oligonucleotide may be 
modified at any position on its structure with substituents generally known in the art. 

The al-2FucT antisense oligonucleotide may comprise at least one modified 
base moiety which is selected from the group including but not limited to 5-fluorouracil, 
5-bromouracil, 5-chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 

10 5-(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thio uridine, 

5-carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, 
N6-isopentenyladenine, 1 -methylguanine, 1 -methylinosine, 2,2-dimethylguanine, 
2-methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 
7-methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta- 

15 D-mannosylqueosine, 5'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio-N6- 
isopentenyladenine, uracil-5-oxy acetic acid (v), wybutoxosine, pseudouracil, queosine, 

2- thiocytosine, 5-methyl-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil- 
5-oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino- 

3 - N-2-carboxy propyl) uracil, (acp3)w, and 2,6-diaminopurine. In another embodiment, the 
20 oligonucleotide comprises at least one modified sugar moiety selected from the group 

including but not limited to arabinose, 2-fluoroarabinose, xylulose, and hexose. 

In yet another embodiment, the oligonucleotide comprises at least one 
modified phosphate backbone selected from the group consisting of a phosphorothioate, a 
phosphorodithioate, a phosphoramidothioate, a phosphoramidate, a phosphordiamidate, a 

25 methylphosphonate, an alkyl phosphotriester, and a formacetal or analog thereof. 

In yet another embodiment, the oligonucleotide is an a-anomeric 
oligonucleotide. An a-anomeric oligonucleotide forms specific double- stranded hybrids 
with complementary RNA in which, contrary to the usual P-units, the strands run parallel to 
each other (Gautier et al., 1987, Nucl. Acids Res. 15:6625-6641). The oligonucleotide may 

30 be conjugated to another molecule, e.g., a peptide, a hybridization-triggered cross-linking 
agent, a transport agent, a hybridization-triggered cleavage agent, etc. 

Oligonucleotides of the invention may be synthesized by standard methods 
known in the art, e.g., by use of an automated DNA synthesizer (such as are commercially 
available from Biosearch, Applied Biosystems, etc.). As examples, phosphorothioate 

35 oligonucleotides may be synthesized by the method of Stein et al. (Stein et al., 1988, Nucl. 



-35- 



NY2- 950266.1 



Acids Res. 16:3209), methylphosphonate oligonucleotides can be prepared by use of 
controlled pore glass polymer supports (Sarin et al., 1988, Proc. Natl. Acad. Sci. U.S.A. 
85:7448-7451), etc. 

In a specific embodiment, an cd-2FucT antisense oligonucleotide comprises 
5 catalytic RNA, or a ribozyme (see e.g. , PCT Publication WO 90/1 1 364, published October 
4, 1990; Sarver et al., 1990, Science 247:1222-1225). In another embodiment, the 
oligonucleotide is a 2'-0-methylribonucleotide (Inoue et al., 1987, Nucl. Acids Res. 
15:6131-6148), or a chimeric RNA-DNA analogue (Inoue et al., 1987, FEBS Lett. 215:327- 
330). 

10 In a preferred embodiment, the antisense nucleic acids of the invention are 

h expressed intracellular^ by transcription from an exogenous sequence. For example, a 

y vector can be introduced in vivo such that it is taken up by a cell, within which cell the 

O vector or a portion thereof is transcribed, producing an antisense nucleic acid (RNA) of the 

:l : invention. Such a vector would contain a sequence encoding the a 1 -2FucT antisense 

yj 1 5 nucleic acid. Such a vector can remain episomal or become chromosomally integrated, as 

long as it can be transcribed to produce the desired antisense RNA. The antisense nucleic 
!=& acid can be administered by use of an adenoviral or retroviral vector (see US 4,980,286), by 

direct injection, or by use of microparticle bombardment (e.g., a gene gun; Biolistic, 
Q Dupont), by coating with lipids or cell-surface receptors or transfecting agents, or by 

20 administering it in linkage to a homeobox-like peptide which is known to enter the nucleus 
(see e.g., Joliot et al., 1991, Proc. Natl. Acad. Sci. USA 88:1864-1868), etc.. Vectors can be 
constructed by recombinant DNA technology methods standard in the art. Vectors can be 
plasmid, viral, or others known in the art, used for replication and expression in mammalian 
cells. Expression of the sequence encoding the <xl~2FucT antisense RNA can be by any 
25 promoter known in the art. Such promoters can be inducible or constitutive. Such 
promoters include but are not limited to: the SV40 early promoter region (Benoist and 
Chambon, 1981, Nature 290:304-3 10), the promoter contained in the 3' long terminal repeat 
of Rous sarcoma virus (Yamamoto et al., 1980, Cell 22:787-797), the herpes thymidine 
kinase promoter (Wagner et al., 1981, Proc. Natl. Acad. Sci. U.S.A. 78:1441-1445), the 
30 regulatory sequences of the metallothionein gene (Brinster et al., 1982, Nature 296:39-42), 
etc. 

The antisense nucleic acids of the invention comprise a sequence 
complementary to at least a sequence-specific portion of an RNA transcript of an <xl-2FucT 
gene. However, absolute complementarity, although preferred, is not required. A sequence 
35 "complementary to at least a portion of an RNA," as referred to herein, means a sequence 
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having sufficient complementarity to be able to hybridize with the RNA, forming a stable 
duplex; in the case of double-stranded al-2FucT antisense nucleic acids, a single strand of 
the duplex DNA may thus be tested, or triplex formation may be assayed. The ability to 
hybridize will depend on both the degree of complementarity and the length of the antisense 
5 nucleic acid. Generally, the longer the hybridizing nucleic acid, the more base mismatches 
with an cd-2FucT RNA it may contain and still form a stable duplex (or triplex, as the case 
may be). One skilled in the art can ascertain a tolerable degree of mismatch by use of 
standard procedures to determine, e.g., the melting point of the hybridized complex. 

10 5.8. BIOCHEMICAL ASSAYS USING <xl-2FucT PROTEINS 

The functional activity of al~2FucT proteins or derivatives can be assayed 
by various methods known to one skilled in the art. 

For example, as illustrated in Section 6.3.2, infra, the activity of 
al - > 2fucosyltransferase coupled to the IgG-binding domain of Protein A can be determined 

1 5 in reaction mixtures containing 2.5 umol of HEPES buffer, pH7.2, 30 ug of GM, 

ganglioside or nLc 4 , 250 ug phosphatidylglycerol, 1 umol of MnCl 2 , 0.5 umol of CDP- 
choline, 15 nmol of GDP-[ l4 C]fucose (15,000 cpm/nmol), and bound to IgG-agarose beads 
in a total volume of 0.1 ml. The reaction mixtures are incubated for 2 h at 37°C, terminated 
by the addition of 0.1 ml of CHC1 3 :CH 3 0H (2:1), streaked onto 4-cm-wide strips of 

20 Whatman 3 paper and developed with water overnight. The papers are dried and the labeled 
product extracted from the origins with 2- to 5-ml washes of CHC1 3 :CH 3 0H:H 2 0 (10:5:1). 
The combined eluates are concentrated to dryness by an N 2 stream and dissolved in 20 ul of 
CHCl 3 :CH 3 OH (2:1). A 10-ul aliquot of each is spotted onto a HP-TLC plate (Merck)and 
developed in a solvent system composed of CHC1 3 :CH 3 0H:H 2 0 (60:40:9), 0.02% 

25 CaCl 2 -2H 2 0. The radioactive products were located by autoradiography. 

5.9. ADDITIONAL APPLICATIONS AND USES OF al~2FucT NUCLEIC 
ACIDS AND PROTEINS 

Provided below are additional non-limiting methods of using the al-2FucT 
^ nucleic acids and proteins of the invention. 

5.9.1. DETECTION OF ONCOGENESIS 

As cd-2FucT expression is often activated during oncogenic transformation 
(see Section 2.5 supra), oncogenic transformation of test tissues can be detected by assaying 
for changes in the expression of al -2FucT, for example by the methods described below. 
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Assays for changes in gene expression are well known in the art (see e.g. , 
PCT Publication No. WO 96/34099, published October 31, 1996, which is incorporated by 
reference herein in its entirety). 

In particular, the assays may detect the presence of increased or decreased 
expression of ccl-2FucT gene or protein on the basis of increased or decreased mRNA 
expression (using, e.g., nucleic acid probes), increased or decreased levels of related protein 
products (using, e.g., the antibodies disclosed herein), or increased or decreased levels of 
expression of the catalytic product of the al-2FucT gene (e.g. Fucosyl-GM,). 

10 5.9.2. GENE THERAPY 

As cd-2FucT expression is often activated in cell transformation, e.g. small 
cell lung carcinoma, down-regulation of al-2FucT expression, e.g. by anti-sense nucleic 
acids to al-2FucT coding sequences, may be used to inhibit, suppress or treat cancer (see, 
supra, Section 5.7). In one illustrative example, the anti-sense sequences are transduced in 
15 viral vectors, e.g. adenoviral or retroviral vectors. 

5.9.3. PREPARATIVE SYNTHESIS OF FUCOSYL-SACCHARIDE 
COMPOSITIONS 

The specificity of cd-2FucT of the invention lies in its recognition of the 
20 carbohydrate structure, Galp 1 - 3GalNAc, when found at the terminus of a molecule. While 
the enzyme is utilized in vivo to catalyze the addition of fucose in an a 1-2 linkage to the 
terminal galactose residue of the ganglioside GM,, in vitro it is used to catalyze the addition 
of fucose in an a 1-2 linkage to the terminal galactose residue of any molecule having a 
terminal Gaip 1 -* 3GalNAc moiety. Such molecules include glycoproteins, glycolipids, 
glycolipoproteins and oligo- or poly- saccharides. 

The al ~2FucT proteins of the invention may be used in any of numerous 
forms known in the art, e.g., as an isolated or purified protein in solution, in a cellular 
fraction of a cell population that expresses the ccl-2FucT proteins (see, supra, Section 5.2) 
or immobilized, for example on a substrate or planar surface or in liposomes, micelles, 
microparticles, or microcapsules, etc.. 

According to one embodiment of the present invention, the <xl-2FucT 
protein (or a catalytic derivative or analog thereof) can be used in the preparative synthesis 
of a molecule which comprises a Fucal-2 Gal(5l-3GalNAc moiety, said method 
35 comprising contacting isolated or purified rat al-2FucT of the invention with GDP-fucose 
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and a molecule having a terminal Gaipi-3GalNAc moiety for a time sufficient to permit the 
rat al-2FucT to transfer the fucose to said molecule and recovering a molecule which 
comprises Fucal-2 Gaipi ^3GalNAc. In one mode of this embodiment, the molecule 
having a terminal Gaipi-3GalNAc moiety is a glycolipid, a glycoprotein, a glycolipoprotein 

^ or an oligo- or polysaccharide. The oligo- or polysaccharide can be a free saccharide or can 
be an saccharide immobilized, for example by means of a linker moiety to a substrate or 
surface. A free saccharide having a Fuccd-2 Gaipi-3GaINAc moiety can be obtained by 
cleavage of the linker moiety. 

1Q According to an alternative embodiment of the present invention, a cell 

fraction having catalytic activity of al-2FucT protein (or a catalytic derivative or analog 
thereof) can be used in the preparative synthesis of a molecule which comprises a Fucal-2 
Gaipi~*3GalNAc moiety, said method comprising contacting a cell fraction having rat 
a 1 -2FucT of the invention with GDP-fucose and a molecule having a terminal 

1 5 Gaip 1 -3GalNAc moiety for a time sufficient to permit the rat a 1 -*2FucT to transfer the 
fucose to said molecule and recovering a molecule which comprises Fucal-2 
Gaip 1 - 3GalNAc. In one mode of this embodiment, the molecule having a terminal 
Gaipi-3GalNAc moiety is a glycolipid, a glycoprotein, a glycolipoprotein or an oligo- or 
polysaccharide. The oligo- or polysaccharide can be a free saccharide or can be a saccharide 

20 

immobilized, for example by means of a linker moiety to a substrate or surface. A free 
saccharide having a Fucal -2Gaipi ->3GalNAc moiety can be obtained by cleavage of the 
linker moiety. 

According to a specific embodiment, the cd-2FucT (or a catalytic derivative 
25 or analog thereof) is used for the preparative synthesis of fucosyl-GM,. In one non-limiting 
example of this specific embodiment, fucosyl-GM, is prepared as follows: a reaction 
mixture composed of 25 umol of HEPES buffer, pH 7.2, 10 umol of MnC12, 500 ug 
CHAPSO, 0.5 mg GM, is contacted with a crude cell homogenate of COS-7 cells transiently 
transfected with plasmid containing the rat al~2fucosyltransferase coding sequence. 
30 Progress of the reaction can be followed with time by withdrawing aliquots of the reaction 
mixture and spotting it on an HPTLC plate. The plate is then developed in a solvent system 
composed of CHCL 3 :CH 3 OH:H 2 0, 60:40:9, containing 0.02% CaCl 2 . Glycolipid bands are 
determined by orcinol spray. Fucosyl-GM, is recovered. 



5.9.3.1. USES OF SACCHARIDE COMPOSITIONS 
PRODUCED BY al-2FucT 
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The glycoproteins, glycolipids, glycolipoproteins or free oligo- or 
polysaccharides containing a Fucal-2 Galpl~*3GalNAc moiety produced by <xl-2FucT 
possess nutritional value, and as such may be used as food additives for e.g. infant formula 
5 or geriatric formula. 

5.9.3.2. USE OF FUCOSYL-GM, AS AN 

IMMUNOSUPPRESSIVE OR 
IMMUNOTHERAPEUTIC 

j q Fucosyl-GM, is a cell surface antigen present on a variety of tumors. Thus, 

fucosyl-GM, can serve as a vaccine when presented to the immune system of an individual 
with a tumor expressing this antigen by methods known to those skilled in the art. In one 
embodiment, fucosyl-GM, prepared is injected directly into the bloodstream of the - 
individual, where it will elicit an immune response, resulting in the production of antibodies 

15 by B-cells against fucosyl-GM„ which antibodies will recognize cells of the tumor. In 
another embodiment, dendritic cells are extracted from an individual (e.g. by fluorescent 
activated cell sorting (FACS) using an antibody against a cell surface antigen of dendritic 
cells as described in U.S. patent number 5,876,917). Preferably, the dendritic cells are 
induced to proliferate in vitro (e.g. by the method of U.S. patent number 5,851,756). The 

2Q dendritic cells, whether having been induced to proliferate in vitro or not, are then exposed 
to fucosyl-GM,. These cells engulf the fucosyl-GM, antigen and present it on their cell 
surfaces. After re-introducing the fucosyl-GM, -presenting cells into the patient, either into 
the bloodstream or locally at the site of the tumor, the cells will stimulate an immune 
response by activating T-cells, again resulting in the production of anti-tumor antibodies 

25 and/or a cytotoxic cellular T-cell immune response against the tumor. Alternatively, the 
dendritic cells exposed to fucosyl-GM, can be used in vitro to stimulate T-cells of the 
individual which T-cells can then be administered to the patient to afford a cellular immune 
response. 

The present invention is further illustrated by the following non-limiting 

examples. 

6. EXAMPLE: CLONING AND EXPRESSION OF THE CATALYTIC 
DOMAIN OF RAT HEPATOMA GDP-FUCOSE:GM , 
a 1 - 2FUCOS YLTRANSFERASE 
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This example illustrates the cloning and expression of the catalytic domain 
from rat hepatoma H35 cell GDP-fucose: GM, al -2fucosyltransferase, an enzyme which 
is activated during early stages of chemical carcinogenesis in rat liver. 

We have prepared primers based upon consensus sequences of highly 
conserved regions of the al— 2FucT gene and, using an RT-PCR approach, amplified a 
product from H35 cell total RNA. These results have indicated that H35 cells encode a 
novel enzyme, a portion of the 3' end of which has previously been cloned from rat colonic 
adenocarcinoma PROb cells (Piau, J.-P., et al, 1994, Biochem. J. 300:623-626). Using this 
10 information and additional primers from the more 5' end of the gene, we have cloned and 
expressed a 353 amino acid enzyme construct from H35 cell total RNA with 
al — 2fucosyltransferase enzyme activity. 

6.1. MATERIALS 

1 5 Rat hepatoma H35 cells and simian COS-7 cells were obtained from the 

American Type Cell Collection (Manassas, VA). RNAzol B total RNA isolation kit was 
obtained from Tel-Test, Inc. (Friendswood, TX). Plasmids pZErO-1 and pCR2.1-TOPO 
were from Invitrogen (San Diego, CA) and pPROTA was received from Dr. Bruce Macher 
(San Francisco State Univ., San Francisco, CA). Rabbit IgG-agarose beads and DEAE- 

20 dextran were obtained from Sigma (St. Louis, MO). PCR primers were made on a Beckman 
Oligo 1000 synthesizer. GDP-[ 14 C]fucose and [ a- 35 S]dATP were obtained from Dupont 
NEN (Boston, MA). Non-automated DNA sequencing was done using the Sequenase 
Version 2.0 DNA sequencing kit from United States Biochemical Corp. (Cleveland, OH) or 
the SequiTherm EXCEL II DNA sequencing kit from Epicentre Technologies (Madison, 

25 WI). All other reagents were of the highest quality commercially available. 

6.2. METHODS 

6.2.1. CELL CULTURE 

Rat hepatoma H35 cells and simian COS-7 cells were grown in tissue culture 
30 plates in Dulbecco's modified Eagle's medium (DME), supplemented with 10% fetal calf 
serum. The cells were harvested and passed 1 :4 every 5-6 days. 

6.2.2. RT-PCR ANALYSIS OF RAT HEPATOMA H35 CELLS: 
al— 2FUCOSYLTRANSFERASE 

35 
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Total RNA was extracted from approximately 1 x 10 7 rat hepatoma H35 
cells or from 300 mg of F344 whole liver tissue using the RNAzol B method (Tel-test, Inc.). 
The isolated RNA, in 10 mM Tris buffer, pH 7.5, was initially amplified by RT-PCR using 
the following primers: primer I (forward), 5 ' -GGCCGCTTTGGGAACC AGATGG-3 ' (22- 

5 mer) (SEQ ID NOT); primer II (reverse), 5'-GGTTACACTGCGTGAGCAGCGC-3'(22- 
mer)(SEQ ID NO:2). These primers were based upon the consensus of portions of human, 
rabbit and rat intestine al— 2FT coding sequences which have substantial sequence 
homology. The location of these primers in relation to DNA sequences of other al —2 FucT 

^ q enzymes is illustrated in Figure 1 . 

cDNA was made from (-1.75 pg total) RNA using random hexamers as 
primers for MuLV reverse transcriptase. Amplification was then conducted with AmpliTaq 
DNA polymerase using 200pM of each of the above primers in 35 cycles of 95°C for 30 s, 
58°C for 30 s, and 72°C for 1 min in a Coy thermocycler using a Gene Amp PCR kit 

j ^ (Perkin-Elmer, Branchburg, NJ) to obtain a PCR product of approximately 0.6-kb. 

Some DNA sequence was obtained using the Sequenase PCR product ' 
sequencing kit (USB/Amersham, Cleveland, OH) for direct sequencing of PCR products 
using the dideoxy chain termination method (Sanger, F., et al., 1977, Proc. Natl. Acad. Sci. 
USA). The ~0.6-kb rat PCR product was also cloned into the EcoRV site of pZErO plasmid 

2Q (Invitrogen) and sequenced using the Sequenase version 2.0 sequencing kit 

(USB/Amersham) in order to determine the sequence near the 5' and 3' ends of the product. 

Based upon sequencing results, which revealed 99% identity between 197 
nucleotides at the 3' end of the 0.6-kb PCR product and the 5' end of the rat al— 2FTB 
reported earlier (Piau, J.-P., et al., 1994, Biochem. J. 300:623-626), a second reverse primer 

25 was also made, which is homologous with the 3' end of the coding portion of the rat FTB 
gene with stop codon (shown in bold lettering below) and some 3' untranslated sequence: 
primer III (reverse), 5'-TTCCCATCAGAAGGCTCTTCCTGC-3' (SEQ IDNO:3). A 
second, more upstream forward primer was made based upon rabbit RFT-III, which was 
found to be the most homologous gene on the nucleotide level to our rat PCR product. This 

30 1 7-base-pair primer encompassed nucleotides 62-78 of rabbit RFT-III, within the region 
determined to be near the end of the hydrophobic transmembrane domain of the enzyme. 
Although sequence homology between differing al — 2FucT genes is considerably reduced 
in this region compared to more 3' sequences, this particular short sequence showed a 
reasonable degree of homology to aligned regions of rabbit RFT-II (nucleotides 71-87) and 
human Sec2 (nucleotides 29-45) genes as well (see Figure 1). This primer was as follows: 
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primer IV(forward), 5'-CCGCCTCCACCATCTTC-3' (SEQ ID NO:4). RT-PCR was 
conducted on total rat H35 cell RNA as described above to obtain a PCR product of 
approximately 1.1 kb which was cloned into pCR 2.1-TOPO vector and sequenced. A final 
forward primer was made reflecting exclusively the H35 <xl-*2FucT gene sequence and 
5 adaptors for cloning into the pPROTA fusion protein expression vector: primer V 

(forward), 5 ' - ATgaattcCCTCC AGC AGCG AAT A-3 ' (SEQ ID NO:5). An £coRI site 
(shown in lower case above) and an additional C residue (bold) were included in the 
forward primer for in frame cloning into pPROTA (Henion et al., 1994, Glycobiology 
4:193-202). 

10 

6.2.3. CONSTRUCTION OF A RAT al-»2FucT EXPRESSION 
VECTOR AND EXPRESSION OF RT-PCR cDNA 

RT-PCR was performed on rat H35 cell total RNA using primer 

combinations III (SEQ ID NO:3) and V (SEQ ID NO:5), as described above, to obtain a 

L077-kb PCR product, which was then subcloned into pCR 2.1-TOPO. The insert was 

excised with EcoRl and subsequently cloned into the £coRI site of pPROTA plasmid for the 

production of the Protein A-IgG binding domain/ rat H35 cell ccl-2FucT fusion protein 

(Henion, T.R., et al., 1994, Glycobiology 4:193-202). Correct orientation of the PCR insert 

20 was established by Hindlll/ Stul digestion and the resultant construct named CAT-RFT- 
pPROTA. CAT-RFT-pPROTA was transiently transfected into COS-7 cells by the DEAE- 
dextran method (Ausubel, F.M., et al., 1993, Current Protocols in Molecular Biology, 
Wiley, New York). Secreted fusion protein was purified from the conditioned medium of 
cells after 4-5 days on IgG-agarose beads as previously described (Holmes, E.H., et al., 

25 1995, J. Biol. Chem. 270:8145-8151) for the assay of al-2FucT expression. 

6.2.4. od-»2FUCOSYLTRANSFERASE ASSAYS 

al -*2Fucosyltransferase activity was determined in reaction mixtures 
30 containing 2.5 umol of HEPES buffer, pH7.2, 30 ug of GM, ganglioside or nLc 4 , 250 ug 
phosphatidylglycerol, 1 umol of MnCl 2 , 0.5 umol of CDP-choline, 15 nmol of GDP- 
[ 14 C]fucose (15,000 cpm/nmol), and pPROTA-expressed enzyme bound to IgG-agarose 
beads in a total volume of 0.1 ml. The reaction mixtures were incubated for 2 h at 37°C, 
terminated by the addition of 0.1 ml of CHCl 3 :CH 3 OH (2:1), and streaked onto a 4-cm-wide 
35 strip of Whatman 3 paper and developed with water overnight. The papers were dried and 
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the labeled product extracted from the origins with 2- to 5-ml washes of 
CHCi 3 :CH 3 OH:H 2 0 (10:5:1). The combined eluates were concentrated to dryness by an N 2 
stream and dissolved in 20 ul of CHCl 3 :CH 3 OH (2: 1). A 10-ul aliquot of each was spotted 
onto a HP-TLC plate (Merck)and developed in a solvent system composed of 
5 CHC1 3 :CH 3 0H:H 2 0 (60:40:9), 0.02% CaCl 2 _H 2 0. The radioactive products were located 
by autoradiography. 

6.3. RESULTS 

6.3.1. RT-PCR ANALYSIS OF ccl-2FucT EXPRESSION IN RAT 
10 HEPATOMA H35 CELLS 

A survey of aligned nucleotide sequences for human and rabbit al~»2FucT 
enzyme genes indicates areas where very high sequence homology exists between all forms. 
Portions of these aligned sequences are shown in Figure 1. Two of these regions were 
selected for PCR primer design and initial RT-PCR amplification of H35 cell total RNA. 
The location of these regions (designated primers I and II; SEQ ID NO.s: 1 and 2), 
corresponding to nucleotides 220 to 241 and 838 to 859 of the rabbit RFT-III for 
comparison, are also shown in Figure 1 . 

A single PCR product slightly over 0.6-kb in size was obtained using primers 

2Q I (SEQ ID NO:l) and II (SEQ ID NO:2) (Figure 2, lane 1), which corresponds to the 

expected fragment size based upon location of these primer regions in the gene. Sequencing 
on both strands revealed a run of 597 unambiguous nucleotides between the two primer 
sequences, which were compared to rabbit and rat al — 2FTs. Up to 84% homology in 
nucleotide sequence was detected between this rat PCR product and the rabbit gene(s). The 

25 last 1 97 nucleotides at the 3 ' of the PCR product were found to have 99% identity with the 
5' end of the rat al-2FTB fragment reported earlier (Piau, J.-P., et al., 1994, Biochem. J. 
300:623-626). The difference (GTG) was detected at the codon for amino acid 50 encoded 
by the rat FTB fragment (GGT) and was confirmed on two PCR clones with 3 different 
primers. This represents an amino acid change of glycine in FTB to valine in the H35 cell 

30 al -*2FucT at that site. No RT-PCR product was obtained from H35 cell total RNA 

corresponding to rat FTA (Piau, J.-P., et al., 1994, Biochem. J. 300:623-626) using rat FTA 
primers (results not shown). 

To verify that sequences from the rat FTB fragment constituted the 3' region 

^ of the H35 cell al -»2FucT gene, a second RT-PCR experiment was performed using 

primers I (forward) (SEQ ID NOT) and III (reverse) (SEQ ID NO:3) (see, supra, Sections 
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6. 1 and 6.2). These primers reflect the start site used in generating the first PCR product 
through the end of the gene based upon the FTB sequence. As shown in Figure 2, lane 2, a 
product, approximately 0.9 kb in size, was obtained from rat H35 total RNA. This PCR 
product was sequenced and confirmed that rat FTB most probably corresponds to the 3 ' 

5 portion of this gene. 

In general, mammalian membrane-bound glycosyltransferases are composed 
of a short intracellular N-terminal domain, a transmembrane domain, and an extracellular 
stem region and C-terminal catalytic domain. The stem region corresponds to portions of 
the extracellular domain which can be removed and are not required for catalytic activity. 

10 Generally, most sequence homology among ocl -2FucT enzymes occur in the catalytic 

domain with much lower homology found in DNA sequences corresponding to the more N- 
terminal portion of the protein. To obtain an RT-PCR product from H35 cell total RNA 
containing sequences for as much of the N-terminal of the protein as possible to ensure an 
active enzyme would later be expressed, a forward primer (primer IV; SEQ ID NO:4) 
corresponding to portions of the transmembrane domain of rabbit RFT-III (nucleotides 62 to 
78) where reasonable sequence homology exists between enzymes was used in combination 
with primer III (SEQ ID NO:3). The results (not shown) indicated that a PCR product of 
approximately 1.1 kb was generated. Sequencing of this product confirmed that it contained 
the same sequence obtained in the earlier RT-PCR experiments and included an additional 

20 

181 nucleotides of rat H35 cell cd^2FucT sequence at the 5' end. 

To obtain a cDNA containing only confirmed rat al-2FucT sequences for 
insertion into the £coRl site of the pPROTA expression vector, a forward primer (primer 
V; SEQ ID NO:5) was used in combination with primer III (SEQ ID NO:3) in an RT-PCR 

25 experiment. Primer V (SEQ ID NO:5) corresponded to the most 5' end of the confirmed 
rat sequence and contained an adaptor for £coRl cloning and an extra C residue for in- 
frame cloning into the pPROTA vector. A product of approximately 1 . 1 kb (1068 
nucleotides of confirmed rat H35 cell cd^2FucT sequence) was amplified from rat H35 cell 

^ total RNA using primers V and III (Figure 2, lane 3). This product represents the majority of 
the rat H35 cell ccl-2FucT, but is missing the start of the coding sequence encoding 
intracellular and transmembrane domains of the protein. This PCR product was fully 
sequenced (Figure 3 A) and determined to encode the cd -*2FucT associated with malignant 
transformation in rat liver cells. 
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Sequence analysis using the BLAST algorithm (Altschul et al., 1990, J. Mol. 
Biol. 215(3):403-10) determined that the observed sequence is highly homologous to the 
sequences of all presently known al-2FucT coding sequences from human, rabbit and rat. 
It is also virtually identical to the 5' 480 nucleotides of the fragment from rat FTB isolated 

5 by Piau et al. (1994, Biochem. J. 300:623-626), which encodes a polypeptide comprising 
approximately half of the ocl-*2FucT catalytic domain and possessing no catalytic activity. 
The sequence shown in Figure 3 A codes for 353 amino acids and contains four potential N- 
linked glycosylation sites. Table I shows the comparative extent of nucleotide and deduced 

1 0 am i no ac id sequence from all known enzyme forms. As indicated, high homology was 
detected between the amino acid sequences of the human Sec2 enzyme and the rat H35 cell 
al-2FucT enzyme at 77%. Figure 3B shows an aligned deduced amino acid sequence 
comparison between these two enzymes. Rabbit RFT-II and RFT-III enzymes also show a 
high degree of homology at 71% and 68%, respectively, and rat FTA fragment at 70%. 

1 5 There is far less sequence homology between the rat H35 cell a 1 -+2FucT and the human H 
and the rabbit RFT-I enzymes. Thus, this new rat enzyme appears to be more closely related 
to the secretor enzyme than the H enzyme; This is consistent with published results (Larsen, 
R.D., et al., 1990, Proc. Natl. Acad. Sci. USA 87:6674-6678; Kelly, R.J., et al., 1995, J. 
Biol. Chem. 270:4640-4649; Hitoshi, S., et al., 1995, J. Biol. Chem. 270:8844-8850; 

20 

Hitoshi, S., et al., 1996, J. Biol. Chem. 271:16975-16981) which show a proportionally 
higher specificity for GMj acceptors compared to lacto- or neolacto-series acceptors for 
secretor enzyme-like forms compared to the H enzyme. 



25 



30 



35 
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Table I 



Comparison of Percent Homology of the Catalytic Domain of Rat Hepatoma H35 Cell 
<xl-2FucT with Other Cloned al-2FucT Enzyme Sequences 



% Homology 1 based on 



xt. „i a* j c Deduced Amino Acid 
Nucleotide Sequence 

Enz y me Sequence 

Human H 62 58 

10 Human Sec2 73 77 

Human Seel 69 66 

Rabbit FT-I 64 59 

Rabbit FT-II 71 71 

15 Rabbit FT-HI 75 68 

Rat FTA 69 70 

Rat FTB 99 99 

' The percentages in the table reflect the degree of homology in the nucleotide stretches 
representing the overlap between rat GM,-specific al-2FucT and the other txl-2FucT 
sequences. Sequences outside the overlap are not taken into consideration when calculating the 
20 homology percentages. 



6.3.2. ANALYSIS OF pPROTA-EXPRESSED H35 CELL al-2FucT 
ACTIVITY 

Expression of CAT-RFT- pPROTA results in the production of a fusion 
25 protein composed of the protein A-IgG-binding domain and the al -2FucT sequence (SEQ 
ID NO: 10) shown in Figure 3 A. The expressed protein is conveniently isolated by binding 
to IgG-agarose beads which can be directly assayed for enzyme activity. As shown in 
Figure 4, lane 1, the expressed H35 cell al -2FucT was found to transfer fucose to GM,. 
3Q No detectable transfer was observed to the neolacto-series acceptor nLcOse 4 Cer (lane 3), 
whose carbohydrate moiety is characterized by a terminal Gaipi-4GlcNAcpl saccharide. 
Further, no transfer to GM, was observed with beaded enzyme obtained after inserting the 
H35 cell al ^2FucT cDNA into pPROTA in the reverse orientation (lane 2). 

35 

6.4. DISCUSSION 
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Aligned sequences of human and rabbit al-2FucT's demonstrate 
considerable homology in regions corresponding to the catalytic domain of the enzyme. 
According to the present invention, an RT-PCR cloning strategy utilizing primers 
corresponding to consensus sequences between these genes was successful in amplifying the 

5 appropriate coding sequence from rat H35 cell total RNA. The results demonstrate that this 
approach provided a significant portion of the H35 cell cd -2FucT sequence. The initial 
PCR sequence which was illustrated in Figure 3 A overlapped with that from the previously 
published rat FTB fragment (Piau, J.-P., et al., 1994, Biochem. J. 300:623-626). The rat 

1 0 FTB sequence, when placed in tandem with our upstream sequence, yielded a coding 

sequence for 292 amino acids and a stop codon. Subsequent use of a primer encompassing 
the sequence surrounding this stop codon, as well as another encoding a portion of the 
transmembrane domain of rabbit RFT-III, yielded a cDNA encoding the extracellular - 
portion of the rat H35 cell al - 2FucT. The cDNA corresponding to confirmed rat ' 

15 al - 2FucT sequences when expressed in the pPROTA vector yielded a protein A-IgG- 
binding domain fusion protein with GM,-specific al - > 2FucT activity. 

The observed cDNA sequence of the H35 cell al~*2FucT was found to be 
distinct yet highly homologous to relevant portions of the genes from other species (Larsen, 

20 R.D., et al., 1990, Proc. Natl. Acad. Sci. USA 87:6674-6678; Kelly, R.J., et al., 1995, J. 
Biol. Chem. 270:4640-4649; Hitoshi, S., et al., 1995, J. Biol. Chem. 270:8844-8850; 
Hitoshi, S., et al., 1996, J. Biol. Chem. 271:16975-16981; Piau, J.-P., et al, 1994, Biochem. 
J. 300:623-626). As indicated above, the H35 cell al^2FucT cDNA obtained is missing 5' 
regions of the gene encoding the intracellular and transmembrane domains of the enzyme, 
corresponding to an estimated 15 to 30 amino acids from the N-terminal of the protein 
based upon sequence alignments with cloned full length proteins. In general, this region has 
a lower degree of sequence homology in comparison with other al -2FucT's. Thus, the 
degree of homology contained within only the extracellular domain may be slightly higher 

30 than if the entire coding sequences are compared. 

The results indicate that repeated RT-PCR experiments with several primers 
provided cDNA products with clear, unambiguous, and identical sequences. There was no 
evidence suggesting multiple PCR products were generated with any primer combination 
used. In particular, no sequence corresponding to the rat FTA gene (Piau, J.-P., et al., 1994, 

35 Biochem. J. 300:623-626) was obtained, even when primers specific for FTA were used. 
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Thus, rat hepatoma H35 cells most probably express only a single cd -*2FucT enzyme, one 
with very high specificity for ganglio-series acceptors. 

7. EXAMPLE: CLONING AND EXPRESSION OF FULL LENGTH GDP- 
5 FUCOSE:GM , al-2FucT 

7.1. CLONING 

We have cloned the entire coding region of the rat cd-2FucT gene. Based 
upon information obtained from a 2984 bp Rattus norvegicus FTB mRNA sequence found 
in GenBank databases (Koda, Y., Submitted to the DDBJ/EMBL/GenBank databases, 1997, 

1 0 Accession #AB006 1 3 8), a forward primer was designed from the putative start of 

translation, determined by the rules of Kozak (Kozak, M, 1992, Ann. Rev. Cell Biol. 8:197- 
225). This mRNA was found to contain 213 nucleotides of upstream untranslated sequence 
and over 1580 bp of 3' untranslated sequence. As it is reported in the GenBank database, 
this particular sequence has an error at amino acid 354 (A-H) immediately followed by a 
nonsense mutation (TAA) for premature termination and therefore, would not express an 
active enzyme. In addition to the amino acid 354 (A-H) change, this sequence (as reported) 
is missing 26 crucial amino acids at the C terminus. These are: 
LTPACPRSHFHLKAKGVTCYVAGRAF (amino acids 355-380 of SEQ ID NO:8). 

20 However, based on correct sequence information at the 5' end, a new forward primer was 
designed: 

Primer VI: 5' GCCATGGCCAGCGCCCAGGTTCCT 3' (SEQ ID NO:6). This primer 
was used in conjunction with Primer III (SEQ ID NO:3), described above, to RT-PCR the 
entire 1 140 bp al-2FucT coding region (results not shown). This PCR product was 

25 sequenced (Figure 5) and determined to be the nucleotide sequence encoding the cd -2FucT 
associated with malignant transformation in rat liver cells. Table II shows the comparative 
extent of nucleotide and deduced amino acid sequence from all known enzyme forms. Full 
length od -2FucT cDNA was then cloned into pcDNA 3 vector (Invitrogen) in both the 
positive (FL-RFT-pcDNA3) and negative (FL-RFT(-)-pcDNA3) orientations for later 

30 cd-2FucT enzyme assays. 
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Table II 

Comparison of Percent Homology of the Full Length Rat Hepatoma H35 Cell 
ccl-»2FucT with Other Cloned cd->2FucT Enzyme Sequences 

% Homology 1 based on 



x f , ... c Deduced Amino Acid 

Nucleotide Sequence 
Enz y me Sequence 



Human H 


62 


55 


Human Sec2 


74 


77 


Human Seel 


70 


63 


Rabbit FT-I 


64 


56 


Rabbit FT-II 


71 


67 


Rabbit FT-III 


75 


65 


Rat FTA 


69 


70 


Rat FTB 


99 


99 



The percentages in the table reflect the degree of homology in the nucleotide stretches 
representing the overlap between rat GM,-specific al-2FucT and the other al-2FucT 
sequences. Sequences outside the overlap are not taken into consideration when calculating the 
homology percentages. 



7.2. ANALYSIS OF EXPRESSED FULL LENGTH <xl-»2FucT ACTIVITY 

25 Expression of full length a 1 -2FucT cDNA in transfected cells results in 

membrane-bound enzyme. COS-7 cells were transiently transfected by the DEAE-dextran 
method (Ausubel, F.M., et al., Current Protocols in Molecular Biology, Wiley, New York) 
with either FL-RFT-pcDNA3 or FL-RFT(-)-pcDNA3. Four to five days later, the cells were 
harvested, sonicated in HEPES, glycerol, DTE buffer and assayed for enzyme activity as 

30 described previously (Sherwood, A.L., et al, 1998, Arch. Biochem. Biophys. 355:215-221). 

As shown in Figure 6, the expressed recombinant full length enzyme 
transfers fucose to GM, with high efficiency; much higher than the pPROTA-expressed 
truncated enzyme. Comparable transfer was observed in the presence or absence of 
CHAPSO detergent (100 ug); (lanes D and A, respectively). Somewhat less transfer to GM, 

35 was observed in the presence of 250 ug of phosphatidylglycerol (lane B) and significantly 
less transfer was observed in the presence of both phosphatiylglycerol (250 fig) and 
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G3634A detergent (100 fig) (Lane C). No fucose transfer was observed under any of these 
conditions in homogenates from COS-7 cells transfected with FL-RFT(-)-pcDNA3 (results 
not shown). 

5 8. EXAMPLE: RT-PCR OF al-2FucT IN RAT LIVER AFTER 
ADMINISTRATION OF A CARCINOGEN 

Expression of al-2FucT in F344 rat liver before and after administration of 
the carcinogen 0.03% N-2-acetylaminofluorene (AAF) in the diet (Holmes, E.H., 1990, 
Carcinogenesis 1 1 : 89-94) was tested by RT-PCR of total RNA using primers I and II and 

10 compared to the results with H35 cell total RNA. Total RNA was extracted from 

approximately 200 mg of normal, healthy Fisher 344 rat liver tissue and 200 mg of liver 
tissue from rats fed a diet containing 0.03% AAF for > 3 weeks using the RNAzol B method 
(Tel-test, Inc., Friendswood, TX). RT-PCR was conducted as described (Sherwood, AX., et 
al., 1998, Arch. Biochem. Biophys. 355:215-221), with 200 pM of primers I and II which 

1 5 have been found to reproducibly yield a single PCR product of approximately 0.6-kb. This 
represents a portion of the GDP-fucose:GM, specific al-2FucT present in rat hepatoma 
H35 cells. Results are shown in Figure 7. 

As shown in Figure 7, an approximately 0.6-kb product corresponding to that 
derived from H35 cell total RNA (lane 1) was obtained with total RNA derived from liver 
20 after 0.03% AAF feeding (lane 3). No PCR product was obtained in the same experiment 
from total RNA isolated from normal F344 liver (lane 2). The AAF-fed rat liver sample 
chosen for this study was one of several which displayed a moderate level of cd-*2FucT 
enzyme activity following a feeding regimen of AAF carcinogen. An identical PCR product 
was also obtained in a later experiment using a second AAF-fed rat liver sample, which had 
25 previously been found to display low to moderate cd-2FucT enzyme activity (results not 
shown). No al-2FucT enzyme activity has ever been detected in normal liver tissue from 
rats fed a standard diet lacking AAF. The results presented in Figure 7 clearly demonstrate 
that mRNA encoding the al-2FucT gene is not expressed in normal F344 rat liver tissue 
but is present in liver tissue after administration of 0.03% AAF. The observation that both 

0 enzyme activity and mRNA specific for al-2FucT is present after only three or more weeks 
of exposure to AAF confirms that this enzyme is induced in the early stages of chemical 
carcinogenesis in rat liver. 

The observation of the induction of synthesis of this enzyme during very 
early stages of chemical carcinogenesis suggests that it is an interesting marker for studying 

5 this process in vivo. Results presented confirm that mRNA encoding the al - 2FucT gene is 
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not expressed in normal F344 rat liver tissue but is present in liver tissue after 
administration of 0.03% iV-2-acetylaminofluorene. 

9. EXAMPLE: INHIBITION OF a 1 -2FucT ACTIVITY BY ANTISENSE 
TREATMENT 

The ability of antisense cd -2FucT nucleotides to inhibit al ^2FucT activity 
was assessed in COS-7 cells in which a constant "dose"(l ug) of FL-RFT-pcDNA3 sense 
cDNA was transiently transfected with increasing "doses"(l, 2, 3 and 5 fig) of FL-RFT(-)- 
pcDNA3 antisense cDNA and varying amounts of pcDNA3 vector (no insert) in each case 
to maintain equi-molar ratios of total plasmid transfected into cells under each condition. 
Four to five days later, COS-7 cells were harvested, sonicated in HEPES, glycerol, DTE 
buffer and assayed for ccl-2FucT activity as previously described (Sherwood, A.L., et al., 
1998, Arch. Biochem. Biophys. 355:215-221). A progressive decrease in enzyme activity 
was observed with increasing concentrations of antisense al-2FucT cDNA (Figure 8). We 
chose to initially test this system in COS-7 cells because we have had consistently excellent 
(and rapid) results expressing constructs of various human al-3fucosyltransferase genes (as 
well as rat ocl-2FucT constructs) in this line. We currently have FL-RFT(-)-pcDNA3 stably 
transfected H35 hepatoma cells undergoing selection in G418 medium. Our results 
demonstrate a highly effective antisense treatment system for the down-regulation of rat 
ccl^2FucT. 



10. EXAMPLE: PREPARATIVE IN VITRO BIOSYNTHESIS OF FUCOSYL- 
GM, UTILIZING RECOMBINANT RAT al-2FUCOSYLTRANSFERASE 

Preparative biosynthesis of fucosyl-GM, was conducted in reaction mixtures 
composed of 25 umol of HEPES buffer, pH 7.2, 10 umol of MnC12, 500 ug CHAPSO, 0.5 
mg GM„ and 2 mg crude cell homogenate of COS-7 cells transiently transfected with 
plasmid containing the entire rat al-2fucosyltransferase coding sequence in a total volume 
of 0.5 ml. Progress of the reaction was followed with time by withdrawing 2 ul of the 
reaction mixture and spotting it on an HPTLC plate. The plate was developed in a solvent 
system composed of CHCL 3 :CH 3 OH:H 2 0, 60:40:9, containing 0.02% CaCl 2 . Glycolipid 
bands were determined by orcinol spray (Figure 9). 

The results demonstrate the appearance of increasing amounts of a slower 
migrating band corresponding to fucosyl-GM, from transfer of fucose in the cd-2-linkage 
to the added GM, acceptor with time. The enzyme is very active yielding almost complete 
conversion to fucosyl-GM, after 24 to 48 hours. This preparative biosynthesis can be scaled 
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appropriately to provide any amount of fucosyl-GM, product needed and is advantageously 
useful for commercial scale production of fucosyl-GM,. 

11. DEPOSIT OF MICROORGANISMS 

5 The following microorganisms were deposited with the American Type 

Culture Collection (ATCC), 10801 University Blvd., Manassas, VA on April 22, 1999 and 
have been assigned accession numbers as indicated. 

Microorganism Accession Number 

io " 

CAT-RFT-pPROTA in E. coli INVa — 
FL-RFT-pcDNA3 in E. coli DH5a — 



The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in addition 
to those described herein will become apparent to those skilled in the art from the foregoing 
description and accompanying drawings. Such modifications are intended to fall within the 
scope of the appended claims. 

Various references are cited herein above, including patent applications, 
patents, and publications, the disclosures of which are hereby incorporated by reference in 
their entireties. 
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